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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 

TATTTATATT TTTTTATCTC TGGCATACTT TGATACCTTT TTAGACTTAA AGTCTTTAAT 60 

AGTGCCTTTC CACCTCTTTT TATCTATAAA GATTCTCCTA CATCATAATT CATTTTTTTA 12 0 

TTTAAACCTT TCTGTCTTAG TTTGTCTTTA TCTTCTTCAT ACCATTTTAA GATTGTCACA 180 

TAGTGGTTTT GATAGGTCTT ACCACTGCTT TCCATGTATC TGGATAGTTT ATTTATCATT 24 0 

ATATCTGTGT GTGAGTTTAA TTTTTCTTTT AGATTTTTAT ATTCTTCTTT GCTTAACCTT 300 

ACATTTTTGA ATTCTCCATA AAAAATGGGG GTGGACTTTT TATCTATCTC TCCCTCTCTC 360 

TCTTTATCTA TCTCTATATC TTTCCATGTA ATTCCAATCT GGAGTACCTC TACTGTCTAT 420 

CGGTAATTTA ATTTTGATAT CTGGCAATAC TGTGCTAGAT ATTTGATCTT TATATTCAGT 480 

ATTTTTTAAA GCTTGCCTAA TAATTGAAGT TAAATAGAAT GCTACTTCTT TATTCAATTC 540 

TTTATTTTTT AATTTTAAAC AATGAATTTT CATATCTAGG CTTGCTTTAT ATTTATGATA 600 

AAAGACTGCT CCTAAAAATG AAACAGATAT AAAATTTTCA AAAACTCTAT AATTTTTATC 660 

ATCTATATCT TCGTAGTAAC CTAAGATACC ATTGTCAATA TTTGTAGCAC TAATTCTAGG 720 

AGTTTTTCCA TCGAGTAAAT ATCTTTTTGG AATAGATGAG CCTGTTGGTA CTTAACTCGA 780 

TTTCCCCTTT TTTTCGGTAA TAAATATTTC TTTTTATTTT GTTGTCTGAT ATTTTTCCTA 840 

CCTGTCCTTT GTAGGATGAG TATTTTCTAG ATTTTCyTGA ATAACTTTTT ACTTGAAGTT 900 

TTAGCTTTTG AACTAGTCGT TGTACTTTCT TTTTGTTTAT TATCAGTCCT GATCTTTTTA 960 

ATATTGCTGT TATTCTCTAT ATCCTATTTT TCATTCATGA TATTCTTTTA CTAATTTTAT 1020 

CTTAAATTCT GTGCTGTATT TGCCATTAAA AAACTGACCT CCTTTAGTTA GTTTTTTGGC 1080 

CTAACTTTTG AGGGTCAGTT CAAAATTTGC GACTTTTAAA TGAATTCCAA TATTCAATTA 1140 

TTAAGAGTTA ACATGGTGCT TGCCAATAGG AATCATTAGA GGCGAATTGG AAATAGGGTC 1200 

ACGTATAATT TTTGCTTCAA GATTAAAGAT ATCTTTAACT AGTTTATCAT TTAGTATATC 1260 

TTCAGGCTTT CCCTCTGCAA CAAGTTTACC TTCTTTAATT GCAAATAGGT AATCAGCGTA 132 0 

TCTTGCTGTT AGATTTATAT CGTGCAAAAT CATGCAAATG GTTGTCTTAT ATTTTTGGTT 13 80 

TAGATCAGTC AAGAGGTCTA ATAGTTCTAT TTGATATGAG ATATCCAAGT AAGTAGTTGG 1440 

CTCATCTAAA AGTAGGATAC TTGTATCTTG GGCTAGGGCT AGAGCTATCC ATACTCTTTG 1500 

CCTTTGACCC CCAGAAAGTT CTTCAACTAG GTTATTTGCT AGATCTTCAA CATTGGCCTT 1560 

AACCATTGAT CTGTTTATTA TTTCAAGGTC ATCTTTTCCA AGACTCTTAA AAGGCTTTCT 1620 

GTAGGGGAAA CGACCACGGC TTACAAGATC AGCTACTGTT ATTGATTCAG GGATTATTGG 1680 

AGATTGAGGT AATATAGCTA TGTGTTTTGC TAAATCTTTT TCTTTATAAG AATTAATTGA 1740 
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TTTATTATCA AGCAATACTT CTCCCTCTAA TGGCTTTATA AGTCGAGACA AGGTTTTAAT 1800 

GAGTGTTGAT TTCCCACAAC CATTTGACCC AATAATAACT GATATTTTTT CTTCAGGTAT 1860 

TTTTATATTT ATATTTTCCA AGATTATTTT TTCATCATAA CCGCAGGTAA GATTATTTGA 192 0 

CCACAGACCT TTCATTATAT ATTCCTCCTG TTCATTTTTA TTAGTAAGTA TATTAAGTAT 1980 

GGTGAACCTA ACAAGCCAGT TACAACACCT ACTGGATATC TAGCTGGTAA AATATTTTGA 2 040 

GAGAATATGT CTGATAACAA AACTAGTAAA ATTCCAACCA ATCCAGCTAA TATTGGGCTT 2100 

CTTTTCTTGC CAATATTTAA GGCTATGGGA CCAGCTAAAA AAGATATACA AGCTATTGGT 2160 

CCTGTAATTG AAGTAGAAAA AGCAGTTAAA GATACAGCGC AAAAAATTAA AACAAGCCTT 2 22 0 

GAAAGCTCGG GATTTGCTCC AAGTCCGATT GCTATTTCTT CACCAAGTTC AATAATTTCT 2280 

AGTCTTTTAT TAAAAAATAA AACTAATATA GTAGCAATAA TACTTACTAT TAGAACAAGA 2 340 

GGTATGTCAT CTAACTTTGT AAAAGATAAA GAGCCACTGA GCCATCTCAT AACTTCTTGT 24 00 

AATTCATATC TTGCTACTTT CAACAATAAA AATGAGGTGC CTGCTCTTGT GACAGCTTGA 2460 

AAACCAATAC CTAATATTAT CAGTCTTGCT GCTGAAAAAC CATCTTTTTT AGCTAGTAAA 2 520 

AATAATATTA AAGATGATGT TAGTCCACAA GTTATTGAAA TAATTCCAGT AGTTAAACTA 25 80 

TTTGTTTTTA ATACCAATAT GCAAAAGACC GCTGCAATAG ATGAAGAACT TGTGACACCG 2 640 

ATTATATCAG GACTTGCAAG AGGATTTCTT AACATAGTTT GAAAGATAAA TCCTGCCAAT 27 00 

CCAAAAGACC AGCCAGCTAT AATTCCTGCT AATAATTTTG GTAATCTAAT TTCCATAATC 27 60 

GAAAAACTAG CTCCAGGAAC AGTTTCACTA TTTAAGACTT TAATCAAAGT TGAAAAAGAA 2 820 

TAACTTTCAT CTCCGATAAG TAAAATGAAA AATGATAGAC TGATTATTAT TAATAAAAAT 2 8 80 

AGTGAGGAAA ATAGTGTTAT TCTATTTTTT CTTTTTTGAA TACCTATAAT TAAATTTTGC 2 940 

ATTAGTTATT AACCCCTCTA TTTTTCATAG TTACATAAAT AAGTACTGGA CCCCCGATTA 3000 

TTGCAGTAAT TATCCCTACT TCAATTTCAC CTGGTTTACC TAACATACGG CCGATTATAT 3 0 60 

CACATATAAG CAAGAGCTCT GCACCTATAA AAGATGAAGA AATGGTCATT GTGCGTATAT 312 0 

CTTTGCTTAT AAATAAGCCA CAAAAGTGAG GAACTATAAG ACCTACGAAG CCAATAGGTC 3180 

CACCAATTGC AGTAATACTT GAACATAAAA GCACACTTGC AATTATTGCA AGTGATCTTA 3240 

TCCTATTAAC ATTAACTCCA AGACCAACAG CCATTTCATC ACCCATAGcT AAAGCGTTTA 3 3 00 

AATCTGATGA AATAAATATA GCTATCAAGT GACCTAAAAT TATAAAAGGT AGTAGTGTAG 33 60 

ATATAGAAGA TAATGTAGCT GCTCCAAGGC TACCTATTTG CCAAAATCTA AATTTGTCTA 3 420 

AGACGTTATT ATTCGGTAAA ATTAAAAAAC TTACAAAACT GCTTAAAGCC ATACTAACAC 3480 
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AAGTTCCTGA TAAGGCAAGT TTTATAGGGG TAAGGCCTGC TTTTCCGTTA CAGCAATCGC 3540 

GTATACAAAA ATTGCACTTA CTAAGCCACC AATGATTGCG 3580 
(2) INFORMATION FOR SEQ ID NO: 210: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

CCAAATTGCT CCACAATTAT TATGGAGTCG TCGTTTGGCA GATGGGCGTG ATATGTGTGC 60 

TCAAGAATGG TTGACAGGCA AGATATTGAC CCCCTATGAT ATGAATCGTA AGCAAATCGT 12 0 

CAATATTTTA ACCCGTCTTC ATCGCTCACG TCCGTTGATG ACACAATTGA GTCGTTTGGG 180 

CTATGCCATG GAAACACCTG TAGATTTACT ACAGTCTTGG CAGGAAACGG CTCCAGATGC 24 0 

TTTGCGTAAA AATCATTTTA TCAGTGAAGT GATGGCTGAT TTACGTCAGA CTATTCCAGG 300 

ATTTAGAGAG GACCATGCGA CCATTGTCCA TGGAGATGTA CGACATAGTA ATTGGATTGA 3 60 

GACAGATAGT GGCTTGATTT ATTTAGTAGA TTGGGATTCG GTTCGCTTGA CCGATCGCAT 42 0 

GTTTGATGTG GCCCATATGC TCTGCCATTA TATTTCAGAA CATCAGTGGA AGGAATGGTT 480 

GACCTACTAC GGTTACAAGT ACAATCAAAC GGTATTAAGT AAATTGTATT GGTATGGTCA 540 

ATTGTCTTAT TTGAGTCAGA TTTCCAAGTA TTATATGAAC CAAGATTTAG AAAATGTCAA 6 00 

TCGGGAGATT CATGGTTTGC GTCATTTCCG AGACAAGTAT GGAAAGAGAA GATGAGAGTT 660 

AGAAATCGTA AAGGGGCAAC AGAATTACTA GAGGCAAATC CCCAGTATGT GGTCCTCAAT 720 

CCCTTGGAAG CCAAGGCAAA ATGGCGGGAC TTGTT^GGCA A ['! 'ATAATCC CATTCATGTG 780 

GAAGTTGGAA GTGGAAAGGG TGCCTTTGTT TCAGGTATGG CCAAGCAAAA CCCTGACATC 84 0 

AAC T AT AT C G GGATTGATAT TCAAAAGTCT GTTTTGAGCT ACGCTTTGGA CAAGGTGCTT 900 

GAAGTTGGAG TGCCTAACAT CAAGCTCTTG TGGGTAGATG GTTCTGACTT AACTGACTAC 960 

TTTGAAGACG GTGAGATTGA TCGCTTGTAT CTGAACTTTT CAGATCCATG GCCGAAAAAA 102 0 

CGCCATGAAA AGCGTCGTTT GACCTACAAG ACCTTCTTGG ATACCTTCAA ACGTATCTTG 108 0 

CCTGAAAATG GAGAAATTCA TTTCAAGACG GATAACCGTG GCTTGTTTGA GTACAGTTTA 1140 

GTGAGCTTTT CTCAATATGG CATGAAACTC AATGGTGTCT GGTTAGATTT GCATGCCAGT 12 00 

GATTTTGAAG GCAATGTCAT GACAGAATAC GAGCAAAAAT TCTCAAACAA GGGGCAAGTT 12 60 

ATCTACCGAG TTGAGGCAGA ATTTTAAGAG ATAACCTAAA ATTAGGCTGT ACAAGTGCTT 132 0 
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TTGCTTTACA TAAGTTGGCA AACGTGCTAT ACTGATAGTA AGAATATGAA AAGTGAGGCG 13 8 0 

GGGAAATATC TTCGCCTCTT GCTTATGAGG AGGTGGACGC AATCGCAACA ATCGTAGAAT 144 0 

TAGTCAGAGA AGTTGTAGAA CCTGTCATAG AAGCTCCTTT TGAACTCGTG GATATCGAGT 1500 

ATGGAAAGAT TGGCAGTGAC ATGATTCTCA GTATTTTTGT AGATAAACCC GAAGAATTAC 15 60 

CTTGAACGAC ACGGCAGACT TGACAGAAAT TATCAGTCCT GTCCTAGACA CCATCAAGCC 162 0 

AGATCCCTTC CCAGAACAAT ATTTCCTAGA AATTACCAGT CCAGGTTTGG AACGTCCTTT 168 0 

GAAAACCAAG GATGCCGTCG CTGGAGCGGT TGGAAAATAC ATCCATGTCG GGCTCTACCA 1740 

AGCCATCGAT AAGCAAAAGG TCTTTGAAGG AACCTTGTTG GCCTTCGAAG AGGACGAGTT 1800 

GACTATGGAA TATATGGACA AGACGCGTAA GAAAACCGTC CAAATTCCAT ACAGTTTAGT 1860 

ATCAAAAGCA CGTTTAGCAG TTAAATTATA GAAAAAGAAA GGATAGCTTT TGAGGATTCA 192 0 

AAAGTGAAGA AAACATGAGT AAAGAAATGC TAGAGGCCTT CCGCATTTTG GAAGAAGACA 198 0 

AGGGAATCAA AAAAGAAGAT ATCATCGACG CAGTAGTAGA GTCCCTTCGT TCCGCTTATC 2040 

GCAGACGCTA TGGTCAGTCA GACAGCGTAG CTATTGACTT CAACGAAAAA ACAGGTGACT 2100 

TTACAGTTTA TACTGTCCGT GAAGTTGTTG ATGAAGTATT TGATAGCCGT TTGGAAATCA 2160 

GCTTGAAAGA TGCTCTTGCC ATTAATTCAG CTTATGAACT TGGAGACAAA ATCAAGTTTG 2220 

AAGAAGCACC AGCTGAGTTT GGTCGTGTAG CAGCCCAATC TGCCAAACAA ACCATCATGG 2 2 80 

AAAAAATGCG CAAgCAAACA CGTGCCATCA CTTACAATAC TTACAAAGAA CATGAGCAAG 2340 

AAATCATGTC TGGTACAGTA GAACGCTTTG ACAACCGCTT TATCTATGTC AACCTTGGTA 2400 

GCATCGAAGC CCAATTGTCA AAACAAGACC AAATTCCTGG AGAAGTTTTT GCTTCTCATG 2460 

ATCGTATCGA AGTTTATGTT TACAAGGTTG AAGACAACCC TCGTGGTGTG AACGTCTTTG 2 520 

TTAGCCGTAG TCATCCAGAA ATGATCAAAC GTTTAATGGA GCAAGAAATT CCAGAAGTTT 2 580 

ATGATGGAAC TGTTGAAATC ATGAGCGTGG CTCGTGAAGC AGGTGACCGT ACGAAGGTTG 2 640 

CTGTTCGTAG CCACAATCCA AACGTGGATG CTATCGGTAC AATCGTTGGA CGTGGTGGTG 2 700 

CTAATATCAA GAAGATTACT AGCAAATTCC ACCCAGCTCG TTACGATGCT AAAAATGACC 2760 

GCATGGTACC AATCGAAGAA AATATCGATG TTATCGAGTG GGTAGCAGAT CCAGCTGAAT 2820 

TTATCTACAA TGCCATCGCT CCTGCTGAGG TTGACCAAGT TATCTTTGAT GAAAACGACA 2 8 80 

GCAAACGTGC CTTGGTGGTT GTTCCAGATA ACAAGCTTTC TCTTGCCATT GGTCGTCGTG 2 940 

GACAAAACGT GCGCTTGGCG GCTCACTTGA CTGGTTACCG TATCGATATC AAGTCTGCTA 3 000 

GCGAATTTGA AGCCATGGAA GACGCTGCTT CAGTAGAGTT GGAAGTAGAA AACGATACTG 3060 
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TAGAAGAATA AAAGCTGCTA GAGGAGGGAA AGATGAAAAC AAGAAAAATC CCTTTGCGCA 312 0 

AGTCTGTTGT GTCTAACGAA GTGATTGATA AGCGTGATTT GCTCCGCATT GTCAAGAACA 3180 

AGGAAGGACA AGTCTTTATT GATcCTACGG GCAAGGCCAA TGGCCGCGGC GCTTATATCA 3240 

AACTAGACAA TGCAGAAGCC CTAGAGGCGA AAAAGAAGAA GGTCTTTAAC CGCAGCTTTA 3300 

GCATGGAAGT GGAAGAAAGC TTTTATGACG AGTTGATCGC TTATGTGGAT CACAAAGTGA 3360 

AAAGAAGAGA GTTGGGACTT GAATAAGCAA AAGATAAGTA ATCTCTTGGG GCTTGCTCAG 342 0 

CGAGCAGGGC GCATCATATC GGGTGAAGAA TTGGTGGTCA AGGCCATTCA AGACGGCAAG 34 80 

GCCAAGTTGG TCTTTCTAGC TCATGATGCT GGACCCAATC TGACCAAGAA GATTCAAGAT 3540 

AAAAGTCATT ATTATCAAGT AGAAATTGTA ACCGTGTTTT CAACACTGGA ATTAAGCATA 3600 

GCAGTCGGGA AATCGAGAAA GGTTTTGGCT GTAACAGATG CTGGATTTAC AAAGAAAATG 3660 

AGGTCTCTTA TGGAATAGAA GAGGAGGACA TGATTTGTCT AAGAAAAGAT TGTACGAAAT 3 720 

CGCAAAAGAA CTTGGAAAAG AAAGTAAAGA AGTTGTAGCG CGTGCAAAAG AGTTGGGCTT 37 80 

GGATGTGAAA AGCCACTCAT CAAGTGTGGA AGAAGCTGTC GCTGCAAAAA TTGCTGCCAG 3 840 

CTTTAAGCCT GCAGCTGCTC CGAAAGTAGA AGCAAAACCT GCAGCCCCAA AAGTAAGTGC 3 900 

AGAAAAGAAA GCCGAAAAAT CTGAGCCAGC TAAACCAGCT GTAGCTAAGG AAGAGGCAAA 3 960 

ACCTGCAGCC CCAAAAGCAA GTGCAGAAAA GAAAGCCGAA AAGTCTGAAC CAGTAAAACC 4 020 

AGCTGTAGCC AAGGAAGAGG CAAAACCAGC TGAGCCAGTC ACTCCGAAAA CAGAAAAAGT 4 080 

AGCGGCTAAA CCGCAAAGTC GTAATTTCAA GGCTGAGCGT GAAGCACGTG CTAAAGAGCA 4140 

GGCAGAGCGA CGCAAGCAAA ATAAGGGCAA TAACCGTGAC CAACAACAAA ACGGAAACCG 42 00 

TCAGAAAAAC GACGGCCGTA ATGGTGGAAA ACAAGGTCAA AGCAACCGCG ACAATCGTCG 4260 

CTTTAATGAC CAAGCTAAGA AGCAGCAAGG TCAGCAAAAA CGTAGAAATG AGCGCCGTCA 4 320 

GCAAGAGGAT AAACGTTCAA ATCAAGCGGC TCCACGTATT GACTTTAAAG CCCGTGCAGC 43 80 

AGCCCTAAAA GCAGAGCAAA ATGCAGAGTA CGCTCGTTCA AGTGAGGAAC GCTTCAAGCA 4440 

GTATCAGGCT GCTAAAGAAG CCTTGGCTCA AGCTAACAAA CGCAAGGAAC CAGAGGAAAT 4 500 

CTTTGAAGAA GCGGCTAAGT TAGCTGAACA AGCACAGCAA GTTCAAGCAG TGGTTGAAGT 4560 

CGTCCCTGAG AAAAAAGAAC CTGCAGTGGA TACACGTCGT AAAAAACAAG CTCGACCAGA 4 620 

CAAAAATCGT GACGATTATG ATCATGAAGA AGATGGTCCT AGAAAACAAC AAAAGAATCG 4 6 80 

AAGTAGTCAA AATCAAGTGA GAAATCAAAA GAATAGTAAC TGGAATAACA ACAAAAAGAA 4740 

CAAAAAAGGC AATAACAAGA ACAACCGTAA TCAGACTCCA AAACCTGTTA CGGAGCGTAA 4 800 

ATTCCATGAA TTGCCAACAG AATT T G AAT A TACAGATGGT ATGACCGTTG CGGAAATCGC 4 8 60 
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AAAACGTATC AAACGTGAAC 
GGCCACACAA AACCAATCCT 
TATCGAAGCC AAACAAAAGG 
AGATGGTTAT CTCAATGAAG 
ACACGTTGAC CACGGTAAAA 
AGGTGAAGCA GGTGGTATTA 
GAAGATTACC TTCCTTGATA 
TGCTTCTGTT ACCGATATTA 
GACTATTGAA GCCATCAACC 
CAAGATTGAT AAACCAGGTG 
TGTGATGTCA ACTGCTTGGG 
CCAAAATATC GAAGAATTGT 
AGCAGACCCA ACAGTTCGTG 
AGGTGCGGTC GCAACCCTTC 
TGTCGGAAAT ACcTTCGGTC 
AGTTGCTGGA CCATCAACAC 
TGACCACTTT GCCGTTTACG 
CAAACGTGCC CTCATGAAAC 
TGATACCCTT AAAGCTGGGG 
AGGTTCTGTT GAAGCCCTTT 
GACTATCGTC CACTCAGCGG 
TTCAAATGCC TTTATCGTTG 
AGAAGCTGAC GATGTGGAAA 
GGAAGAAGCT ATGAAAGGGA 
GGTTATCCGT GAAACCTTCA 
CAACGGTAAG GTTGCCCGTG 
TGATGGTGAA CTCGCAAGCT 
TCGTGAAGGT GGATTGATGA 
GGCGTATGTC ATGGAAGAAA 



1203 

CAGCTGAAAT TGTTAAGAAA CTTTTCATGA TGGGTGTCAT 4920 

TGGATGGGGA AACAATTGAA CTCCTCATGG TGGATTACGG 4980 

TTGAAGTGGA TAATGCTGAC ATCGAACGTT TCTTTGTCGA 504 0 

ATGAATTGGT TGAGCGTCCA CCAGTTGTTA CTATCATGGG 5100 

CAACCCTTTT GGATACTCTT CGTAACTCAC GTGTTGCGAC 5160 

CTCAGCATAT CGGTGCCTAC CAAATCGTGG AAAATGGTAA 5220 

CACCAGGACA CGCGGCCTTT ACATCAATGC GTGCGCGTGG 52 80 

CGATCTTGGT CGTAGCGGCA GATGACGGGG TTATGCCTCA 5340 

ACTCAAAAGC AGCTAACGTT CCAATCATCG TAGCTATTAA 5400 

CTAACCCAGA ACGCGTTATC GGTGAATTGG CAGAGCATGG 5460 

GTGGAGATTC TGAATTTGTT GAAATTTCGG CTAAATTCAA 552 0 

TGGAAACAGT CCTTCTTGTG GCTGAAATCC AAGAACTCAA 5580 

CGATCGGTAC GGTTATCGAA GCGCGCTTGG ATAAAGGAAA 5640 

TTGTACAACA AGGTACCTTG AATGTTCAAG ACCCAATCGT 5700 

GTGTCCGTGC TATGACCAAC GACCTTGGTC GTCGTGTTAA 57 60 

CAGTCTCTAT CACAGGTTTG AACGAAGCAC CGATGGCGGG 5820 

AGGATGAAAA ATCTGCGCGT GCAGCAGGTG AAGAGCGTGC 5880 

AACGTCAAGC TACCCAACGT GTTAGCCTTG AAAACCTCTT 5940 

AACTCAAATC TGTTAATGTT ATCATCAAGG CTGATGTACA 6000 

CTGCCTCACT TCAAAAGATT GACGTGGAAG GTGTCAAAGT 6060 

TCGGTGCTAT CAACGAATCA GACGTGACCC TTGCCGAAGC 5120 

GTTTCAACGT ACGCCCTACA CCACAAGCTC GTCAACAAGC 6180 

TCCGTCTTCA CAGCATTATC TACAAGGTTA TCGAAGAGAT 6240 

TGCTTGATCC AGAATTTGAA GAAAAAGTTA TTGGTGAAGC 6300 

AGGTGTCTAA AGTGGGAACT ATCGGTGGAT TTATGGTTAT 6360 

ACTCTAAAGT CCGTGTTATC CGTGATGGTG TCGTTATCTA 6420 

TGAAACACTA TAAAGACGAC GTGAAAGAAG TGACAAACGG 6480 

TCGACGGCTA CAATGATATT AAGATGGATG ATGTGATTGA 6540 

T C AAGAG ATA AGATTTTTTG CTCCTTTCTT AGGTGGTGAG 6600 
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GGACGCAAGC AAACCGATGG TTTCATTGCT TATTTTTGAG CCTAGGGTCT CAAAAATCCC 6660 

CTGTGATGGG ACTGATAAAT CAGTTCCATC ACTTTCACCA CGGCGAAAGA AGCAGATGAC 672 0 

TTCAAATTGA ACTTCGTTTC AATTTAAACT GAAAATCAAG AAGTTTAAAA TAGCTAGGTC 67 80 

TGCTGGCCTA GCTTTTGGTT CAAAGTAGAG AAAGGAATAT CATGGCAAAT CATTTCCGTA 6840 

CAGATCGTGT GGGCATGGAA ATCAAGCGTG AAGTCAATGA GATTTTGCAA AAGAAAGTCC 6900 

GTGATCCACG TGTCCAAGGT GTGACCATCA TAGATGTTCA GATGCTGGGT GACTTGTCTG 69 60 

TTGCCAAGGT TTATTACACC ATTTTGAGTA ACCTTGCTTC GGATAACCAA AAAGCCCAAA 702 0 

TCGGGCTTGA AAAAGCAACT GGTACCATCA AACGTGAACT TGGTCGCAAT TTGAAATTGT 7080 

ACAAAATCCC AGATTTGACC TTCGTCAAAG ACGAGTCCAT CGAGTATGGA AACAAGATTG 7140 

ACGAGATGCT ACGCAATCTG GATAAGAACT AAAGAAGAGG GGTTGCCCCT CTTTTTTGGT 7200 

GGAGGAAAAT AGGTTGAATT TGAAATGGAA AAATATTCTT TTATAATAGA TTGAAACTAG 726 0 

AATAGTACGC CTCTACTTCT AAAATATTGT TAGAAATCGA TTTGACTGTC CTGATCGATT 732 0 

TGTCCTGTTC TTGTTTCATT TTAATATAAA AAAGGGATTC TGTATTTTTT AATGTTATCT 73 80 

AATTAGAAAA TGCTTTTTTT GTAGGAAATA TAATATGATA AGGTGCAAAA AAGAAATAAG 744 0 

GAGTTTGTAT ATGGCTGAAC AAGACTTAGC TATGCAAGTA TTGCAACAAG TGGTGAAACT 7500 

ACCTGTTGTT AAGGTTGATC GTTCGAAATT TTTAGTGGAT AAGTTTTCCA AAGAATTGGA 7560 

TCCAAAAGAT ATTCCTACCT TATTGGAACA AGGTCCAACG ACTCTTCTAT CTCAAGAAAT 7 620 

ATTAGATCGT GTAGCTAATG CTTGTATTCG GGACAATGTA TTATTAGCGA GTGGGACTTC 7 680 

TGTTTTGGCA GGATTACCTG GAGGGCTTGC TATGGCAATT ACCATTCCAG CTGATGTGGC 7740 

TCAATTTTAT GCTTTCTCTC TGAAATTGGC TCAAGAATTA GGTTATATTT ATGGTTATGA 7800 

GGATCTTTGG GCTTCACGAG AGGAGTTGAG TGAAGATGCT CAAAATACCC TCTTGCTTTA 7860 

TCTAGGCGTA ATGTTAGGGG TGAATGGAAC CGCTGCTTTG CTACGTGTTG GTAGTATAAC 7920 

AATTGCCAAA CAGGTAATGA AAATAGTGCC TAATAAAGCT TTAACAAAGA CGCTTTGGTA 7980 

CCCTATTTTG AAAAAAGTCT TAAAAATATT TGGTGTGAAT CTTACCAAGG GAGGGTTGGC 8040 

CAAAGGAATG GGGAAATTTA TTCCTATCTT GGGTGGTATC ATTTCAGGTG GTTTAACCTT 8100 

TGCAACTATG AAACCAATGG GGGAAAGCTT GCAGAAAGAA TTATCCAAGC TAGTCAACTA 8160 

TAGTGAAGTT CAATATCAAG AAGATGTTGA AACAATCCGA AAAGAGGCTG AAATCATCAA 8220 

AGGAGAGTAA TATGAATCCT ATCAAAGCTT TTGCTAAAAT TTATGGTAAT TACTTTTTGA 8280 

CCGTGCAAGG TGTAAAAGTG ATGAAAACGA TAAAGAAAGC TGACCATGTC GTTGTTGGTC 8340 

TGGGGAAACT TTTTATTGCC GACAAGTTAA TGGATACGGC TCGGTGGCTC ATTAAGCCAG 8400 
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AGGAGAGAGA ATGAAATTTT TTTGGTCTTC TTGCTATTCT TTTTATCAAA CCGATTATTG 84 60 

GGATTGTGAA ATTCTTTTGG ATGATCATCT CTTTTGCAGT CCAATTGCTG TTTTACAAGA 8520 

TAGTGTTTAA GATATTGGAT TGGCTCTTTA AACTTATCTA GATGGTAATC CAAGTTGCAG 8580 

AGAACTAGCA GGAACTCCAC TGCTAGTTTT TTATTCTCTT TCCATATGGT ATAATATAAG 8 640 

CAGTAAAATC ATTTTATACT CTTCGAAAAT CTCTTCAAAC CACGTCAGCT TCACCTTGCA 8700 

GTATATATGT TACTGACTTC GTCAGTTCTA TCCACAACCT CAAAACGGTG TTTTGAGCTG 87 60 

ACTTCGTCAG TTCTATCTAC AACCTCAAAA CACTGTTTTG AGCAACCTGC GGCTAGCTTC 8820 

CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAGAACATAC AATGGAGGTC GTCATGGACA 8880 

ATATCATCGA TGTGTCAATT CCTGTTGCAG AAGTGGTGGA CAAGCATCCA GAAGTCTTGG 894 0 

AAATTCTAGT GGAGTTGGGT TTTAAACCCC TTGCCAATCC CTTAATGCGC AATACAGTTG 9000 

GTCGTAAAGT ATCACTTAAA CAGGGTTCTA AGCTAGCAGG AACTCCTATG GACAAGATTG 9060 

TACGCACACT GGAAGCGAAT GGCTACGAAG TGATTGGATT AGACTAATGA CAGATGAACG 912 0 

GATTCATATC CTACGGGATA TTTTGTTAGA ATTGCACAAT GGCGCCTCTC CTGAGTCGGT 918 0 

TCAAGATCGC TTTGATGCGA CCTTTACGGG CGTGTCAGCC ATCGAGATTT CCCTTATGGA 924 0 

GCACGAGCTG ATGAACTCGG ATTCGGGCGT CACTTTTGAA GATGTTATGG AACTCTGTGA 93 00 

TGTCCATGCC AATCTTTTTA AAAATGCTAT CAAAGGTGTC GAAGTTTCAG ATACTGAGCA 93 60 

TCCAGGTCAC CCAGTTCGTG TCTTCAAAGA AGAAAATCTG GCTCTCCGTG CGGCCTTGAT 9420 

TCGCATTCGT AGATTGTTAG ATACCTATGA GTCTATGGAA GACGAGGAAA TGCTGGCGGA 94 80 

GATGCGTAAG GGTTTGGTGC GTCAGATGGG ACTTGTGGGT CAATTTGACA TCCATTACCA 9540 

ACGTAAGGAA GAACTCTTCT TTCCTATCAT GGAGCGCTAT GGACACGATT CACCTCCCAA 9 60 0 

AGTTATGTGG GGAGTGGATG ATCAGATTAG GGAACTCTTT CAAACAGCTC TAACGACAGC 9660 

CAAGTCACTA CCAGAAGTGT CAATTAGCAG TGTAAAGGAA GCTTTTGAAG CTTTTGCG^C 972 0 

AGAGTTTGAA AGTATGATTT TCAAGGAAGA GTCCATCCTC CTCATGATTC TCCTTGAGTC 97 8 0 

TTTTACTCAG GATGACTGGC TTCAGATTGC GGAGGAGAGC GATGCCTATG GCTATGCCAT 984 0 

CATCCGTCCG TCAGAGAAAT GGGTGCCAGA ACGACAGAGC TTTATTGAGG AAAAGATTGC 9900 

AGAGGAGCCT GTACAGCTAG ATACGGCAGA AGGTCAAGTT CAACAAGTCA TAGATACGCC 9960 

AGAAGGCCAT TTTACCATTA CCTTTACCCC TAAGGAAAAG GAAGCTGTGC TGGACCGCCA 1002 0 

TAGTCAACAG GCTTTTGGTA ATGGCTATCT TTCAGTCGAG CAGGCCAATC TCATCCTCAA 10080 

TCATCTCCCT ATGGAGATTA CCTTTGTCAA TAAAGAAGAT ATTTTCCAGT ATTACAATGA 10140 
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CAATACGCCA GCTGATGAGA TGATTTTCAA ACGGACGCCG TCCCAAGTCG GGCGCAATGT 10200 

CGAACTCTGC CATCCGCCTA AGTACTTGGA CAAGGTCAAA ACTATCATGA AGGGGCTTCG 10260 

TGAGGGAAGC AAAGACAAGT AT G AAATGTG GTTCAAGTCT GAGTCGCGAG GTAAGTTTGT 103 2 0 

CCACATCACC TATGCTGCAG TACACGATGA AGACGGAGAA TTCCAAGGAG TGTTGGAGTA 10380 

TGTTCAGGAT ATCCAGCCCT ACCGTGAGAT TGATACGGAC TATTTTCGTG GATTAGAATA 10440 

AGGAGAAAAA ATGAGTTACG AACAAGAATT TATGAAGGAA TTTGAAGCTT GGGTCAATAC 10500 

CCAAATCATG ATTAACGACA TGGCGCACAA GGAAAGCCAA AAAGTTTACG AAGAAGACCA 10560 

GGACGAGCGT GCCAAAGATG CCATGATTCG CTACGAGAGT CGCTTGGATG CTTATCAGTT 10620 

CTTGCTTGGT AAGTTTGAAA ACTTCAAAGT AGGCAAGGGA TTCCATGATT TGCCAGAAGG 10680 

CTTGTTTGGT GAGCGAAATT ATTAAACGAG AAAGATTCTT GATTTTTCAC TAAAATCTTG 10740 

ATAGAATGTT TATGTTAAAT CCTTGTCAGA GCAGGGATTT TTTATTGAAA GGATTTTATC 10800 

ATGTCAAAGA AACTCAATCG TAAAAAACAA TTACGAAATG GCCTCCGTCG CGCAGGTGCC 10860 

TTTTCAAGTA CGGTGACTAA GGTTGTAGAT GAGACAAAAA AAGTCGTGAA GCGTGCAGAA 10920 

CAGTCAGCAA GCGCAGCTGG TAAGGCTGTT TCTAAAAAAG TTGAACAAGC AGTAGAAGCT 10980 

ACCAAAGAGC AAGCTCAAAA AGTAGCTAAT TCTGTAGAAG ATTTTGCAGC AAATTTGGGT 11040 

GGACTTCCAC TTGATCGTGC CAAGACTTTC TATGATGAAG GAATCAAGTC TGCTTCAGAT 11100 

TTCAAAAACT GGACTGAAAA AGAACTCCTT GCCTTGAAAG GAATCGGCCC AGCTACCATC 11160 

AAGAAATTGA AAGAAAATGG CATCAAGTTC AAGTAATTTT TCTTGAGCCT TGCATTTCCG 11220 

AAAAAATCTT GCTACAATAG AGCCATTAGA GGTGTTTTGA ATCCCACATT TTACAGAAAG 11280 

TGGCGGCGCT GAGAAGTCCA CAAATGTGTC AAAACTGGTT GCTAATGGAT GAAAAATTGA 11340 

AATAAAAGTG TCTTTTTGCT TTAAAGACGA GAGTTGCG 11378 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4156 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

CCGCGAGCCA CGGCGAATTT GCTGCGGGTA TTCATCAGTC AGGATCTATG ATCTTTGGTG 60 

AACAAGAAAA GGTTCAAGTT GTGACCTTTA TGCCAAATGA AGGTCCTGAT GATCTATACG 120 

CTAAGTTTAA TAACGCTGTT GCTGCATTTG ACGCAGAAGA TGAGGTTCTA GTTTTGGCTG 180 
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ACCTTTGGAG TGGTTCTCCA TTTAACCAAG CTAGTCGCGT GATGGGAGAA AATCCTGAGC 240 

GTAAGTTTGC CATCATCACA GGACTTAACT TACCGATGTT GATTCAAGCC TACACAGAGC 300 

GCCTCATGGA CGCTGCTGCA GGTGTAGAAA AAGTCGCTGC TAATATCATT AAAGAAGCCA 3 60 

AAGATGGCAT CAAAGCTCTT CCAGAAGAGC TAAATCCAGT CGAAGAAGTT GCAAGCGCTG 420 

CAGCTGCTCC AGTTGCCCAA ACTGCTATCC CAGAAGGAAC TGTTATCGGA GACGGTAAAT 480 

TGAAAATCAA TCTTGCCCGT CTTGACACAC GTCTACTTCA CGGTCAGGTT GCAACTGCTT 540 

GGACTCCAGA TTCAAAAGCA AATCGTATCA TCGTTGCTTC AGATAACGTG GCTAAAGACG 600 

ACCTTCGTAA AGAATTGATT AAACAAGCAG CTCCAGGTAA TGTCAAGGCT AACGTGGTTC 660 

CAATTCAAAA ACTGATTGAG ATTTCAAAAG ACCCACGTTT TGGAGAAACA CATGCCCTTA 720 

TCTTGTTTGA AACACCTCAA GATGCCCTTC GTGCCATCGA AGGCGGCGTG CCAATCAAGA 7 80 

CTCTTAATGT TGGTTCTATG GCTCACTCAA CAGGTAAAAC ATTGGTCAAT ACCGTTTTGT 840 

CTATGGACAA AGAAGACGTT GCTACATTTG AAAAAATGCG TGACTTGGGT GTTGAATTTG 9 00 

ATGTCCGTAA AGTACCAAAT GATTCTAAAA AAGATTTGTT TGACTTGATT AACAAAGCCA 9 60 

ATGTCAAATA AGCCATTATT TATGAAAGGA TTTTAAACAT GTCTATTATT TCTATGGTTT 1020 

TAGTAGTCGT TGTAGCCTTC TTTGCAGGTC TTGAAGGCAT CCTCGACCAG TTCCAATTTC 1080 

ACCAACCACT TGTAGCCTGT ACCCTTATTG GGCTTGTAAC AGGTCACTTG GAAGCAGGGA 1140 

TTATCCTCGG TGGATCGCTT CAAATGATTG CCCTTGGTTG GTCAAATATC GGTGCTGCTA 1200 

TCCCTCCTGA TGCTGCACTT GCTTCTGTCG CTGCTGCCAT TATCATGGTT CTTGGTGGTG 12 60 

ACTTTACCAA GACTGGTATC GGTGTTGCCC AAGCGGTTGC TATCCCTCTT GCTGTAGCTG 132 0 

GACTTTTCTT GACAATGATT GTTCGTACAA TTTCAGTTGG TTTGGTTCAT ACTGCAGATG 13 80 

CTGCCGCTAA AAAAGGTGAC TTCGGCGCTG TGGAGCGTGC GCATTTCATC GCGCTACTTT 1440 

TCCAAGGACT TCGTATCGCG CTTCCTGCAG CTCTTCTCCT TATGGTACCA ACTGAAACTG 1500 

TACAAAGTAT CCTTAGTGCC ATGCCAGACT GGCTCAAAGA TGGTATGGCT ATCGGTGGTG 15 60 

GTATGGTCGT TGCCGTTGGT TACGCCATGG TTATCAACAT GATGGCAACT CGTGAAGTAT 1620 

GGCCATTCTT CGCTCTTGGT TTCGTTCTCG CTGCTGTGTC AGATATTACT CTAATCGGAT 1680 

TCGGTGCTAT CGGCGTTGCT ATCGCTCTTA TCTACCTTCA CCTTTCTAAA ACTGGTGGAA 174 0 

ATGGTGGCGG AGGAGCCGCA ACTTCTAACG ACCCAATCGG CGATATCCTA GAAGACTACT 1800 

AAGATAAGAA AGGACTGAAA ACATCATGAC TGAAAAACTT CAATTAACTA AATCAGATCG 18 60 

TAAAAAAGTT TGGTGGCGTT CAACCTTCTT ACAAGGGTCT TGGAACTTTG AACGGATGCA 192 0 
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AAACTTGGGC TGGGCTTATA CACTCATTCC AGCTATCAAA AAACTCTATA CTAAAAAAGA 1980 

AGATCAAATC GCTGCTCTTG AGCGTCACCT TGAGTTCTTC AACACTCATC CATACGTAGC 2 04 0 

TGCTCCAGTC ATGGGGGTTA CTCTTGCGCT TGAAGAAGAA CGTGCTAACG GTGTGGAAAT 2100 

CGATGACGCT GCTATCCAAG GGGTTAAAAT CGGTATGATG GGACCTCTTG CTGGTATCGG 2160 

TGACCCAGTA TTCTGGTTTA CAGTACGCCC AATCCTTGGA TCTCTCGGTG CTTCACTTGC 222 0 

CCTTACTGGC AATATCTTGG GGCCACTCCT CTTCTTTGTT GCATGGAACT TGATTCGTAT 2280 

GTCATTCTTG TGGTATGTTC AAGAGATTGG ATACAAGGCT GGATCAGAAA TCACTAAAGA 2340 

TATGTCTGGT GGTATCCTTC AAGATATCAC TAAAGGAGCT TCTATCCTTG GGATGTTCAT 2400 

TCTTGCTGTC CTTGTTCAAC GCTGGGTAAA TATTAAATTT GCTTTCGATG TTTCTAAAGT 24 60 

TCAACTAGAT GAAAAGGCTT ATATCCATTG GGATAAATTG CCAGAAGGGT CTAAAGGTAT 2520 

CCAAGAAGCA TTCGCACAAG TAGGACAAGG ATTGTCTCAA ACTCCTGAAA AAGTTACTAC 2580 

TTTCCAACAA AACTTGGATA TGTTGATTCC TGGATTATOA GGACTACTCC TTACTTTACT 264 0 

TTGCATGTAC TTACTTAAGA AAAAAGTATC TCCAATCACT ATTATCCTTG CCCTCTTCGC 27 00 

AGTGGGTATT GTGGCACATG TTCTTCACAT CATGTAATCA AGCAACTAAA AAGGAACCAG 27 60 

GTTCTAAAAT CTGATTCCTT TTTTCTATGC TTTTATTCAG CCAAGGCTCC CATTGGATCC 282 0 

CATGGTGCAA GTACGATTGG TTCTGCTCCA TAGGCAGCTT GTTCTTCTGC TGTCAGCAAT 2880 

TCCTTACGAA CAACGATTTG GTATGTGTAT TCGTCCATCC AAGCGTCTGA GGCAACAAAG 2 94 0 

TAACCATCTG TACCGACCTT GTCTCCCCAT GAGTTTTCAA CCTTCCACTT GGTTCATTTA 3000 

CCATTTTCGT CCAAGTCAAC ACCTGTCAAG ACCATGGCGT GGGTCATCAA GCTTTCACTA 3060 

TAGTCCAAAC GTCCAGCCTT GTCTTGAGTA AGTTTAATGT CCATGCTTGA TTCAAAGTCA 312 0 

TAAACATCTG TCGCAAGGAT GCCAGCTTAC GGTTGCTGAG CTGGCCGACA TCAGAACCAA 3180 

ACCAAACAGT CTCACCTGCT TGCATTTGGG CAATCGCCAA TTCTTTCAAG CGCTCCATTG 324 0 

GAACGTTGAT GTAGCGAACT GCACGGCTAC CAACCACATT CCCCAACATC TCAACTGTGT 3300 

AAGATTTTCC GTAAGGTTTA TCAGCAGTTG GAGCATTGAT AACAGAAACG TAGTCTTCTA 3360 

AAGGAAGATT GACATATTTC TTGTAAAACT CTTGTGGTGT GATTCCTTTT TCACTTTTGT 3420 

AGTTGTTATC TTTATCGCGA TAAGCAAAGT CAAACTTGCG TGGTGGAAGT CCTAATGACA 3480 

TAGCAAGAAA GTTAAAGATT TCTTGCAAGA GGTCTTCTTT CTTAGCTTGA ACAGTCGCTT 3540 

GATCTGCACC AGAAACAAGC AAGTCACGCA AGATTTGAGC ATCTTGACGA AGCAATTTAT 3600 

TAAGGATCGC ATTTAGCTCA CGACTGCTGC TAGATGAAAC AGACTCAGGA TAAACTGACT 3660 

TAGGCACGAC ACCGTATTTT TCAAAGAGGG AAACGACCAT ATCCCATTGA CCGCCATCTT 3720 
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GTTGAGGTGT TTGGAGTAAG AAGCTAACtT GCGGCTAGTC AATTCTTGGT CTGAAGTCGC 37 80 

AATGACTTGC TCCAAGAACC AGTTTGATTT CTCATACTTA TCCCAGAAGA AAGTGTGGGC 3 840 

TTGTGACAAC TCAAAGTTCT CCAATTTGTA TTGCGAGATG AGTTTGTGGC GGAAGGTGTT 3 900 

GAGAGCCGCA AACATCCAGC AACGACCAGA CGCTTTCTGG TTAGTGACCT TGTCCTTGGT 3960 

TAAATCCAAT GAGAAAACAG GTGTGTTGTC TACATGGCTT TGGCGACGTT CCAGAGCTGC 4 020 

AAAAATTCCG TTGTGGCTGG CAGCATTTTC AATCGCTTGG TATTTTACAT TTGCTTCATA 408 0 

GTTGGCAAAT AGTTTATCAG TAAATGATTC TTGAATCGCG TTCATAGATT CCTCCTTTTA 4140 

GTCTACAGTG TATTGG 415 6 
(2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 3902 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

AAAAACAACA AAATAAAACA AAAACAAAAA TATCGAGGTT TATTTTCAAA ACTTTCGATA 60 

TTTTTATTAA GTTATTATTT TGTTGTTTCT AGTTTACTTT TTGATGGTTA AGAGTGGTGG 120 

AGAATTATAC TCAATGAAAA TCAAAGAGCA AACTAGGAAG CTAGCCGCAG GCTGTACTTG 180 

AGTACGGCAA GGCGAAGCTG ACGTGGTTTG AATTTGATTT TCGAAGAGTA TTAGTGCAAA 240 

CCGTAGTTGT AGTCATCATC TTGCATGGCT TCAACTTCGC CAAGAAGGTA ACCATTTCCG 300 

ACTTGAGAGA AGAAGTCATG GTTGGAAGTT CCTGTTGAAA TACCGTTCAT AACGATTGGG 3 60 

TTGACATCTT CAGCTGAATC TGGGAAAAGT GGATCTTGTC CCATGTTCAT GAGAGCTTTA 42 0 

TTGGCATTGT AGCGAAGGAA GGTTTTAACC TCTTCAGTCC AACCAACACC GTCATAAAGA 4 80 

CTCTCTGTGT AGCCTTCTTC ATTTTCATAA AGAGTATAGA GTAGGTCGTA CATCCATTCT 54 0 

TTGAGTTTTT CTTGCTCTTC TTCAGGTAAT TCATTGAAAC CAAGTTGGAA TTTGTAACCA 600 

ATGTAGGTTC CGTGAACAGA CTCGTCACGA ATAATCAATT TAATGATTTC TGCAACGTTG 6 60 

GCAAGTTTGT TGTTACCGAG ATAGTAGAGG GGAGTGAAGA AACCAGAGTA GAAGAGGAAG 72 0 

GTTTCGAGGA AGACGCTGGC AACTTTCTTT TCAAGTGGGC TGCCGTTTAG GTAGATTTCG 780 

TTGACAATCT CAGCCTTCTT TTGTAGGTAA GGATTGGTAT TGGTCCATTC GAAAATTTCT 840 

TCAATCTCAG CCTTAGTATT CAAGGTAGAA AAGATTGATG AGTAAGATTT AGCGTGGACA 900 



WO 98/18931 



PCT/US97/19588 



1210 

GATTCCATAA ATTGGATGTT ATTGAAGACA GCTTCCTCAT GTGGTGTACG GATGTCTGCG 9 60 

CGAAGGGCTT GAACCCCAGT TTCAGATTGC ATAGTGTCAA GAAGGGTTAA ACCACCAAAA 102 0 

ACTTTTCCGA CCAAGTCTTT CTCTTTGTTA GATAGCTTTC TCCAGTCATC CAAGTCGTTT 1080 

GATAAGGGAA TACGTGTATC GAGCCAAAAT TGCTCCGTCA GTTTTTCCCA AGTTGATTTG 114 0 

TCGATGACAT CTTCGATGGC ATTCCAGTTA ATGGCTTTGT AGTAAGTTTC CATTTAAAAT 1200 

CTCTTTCTGT GTTTAGTATT GCGAACTCAC AATTATTTCT ACTTTACCAT AATTCTATAG 1260 

GAGTATCGCA CAAAAAGTCG GAAGCCCGAC TTTTAAAATG TTACATAAAT TATGTTATGA 132 0 

CATAGTAGAT TTGATTTTAT CAGTGCTGCT TAGGGAAAAA TAGTGTTTCT ATGCTAGAAA 1380 

CTAAATCACA CAGCTTTCAC ATTGGTTGGC GCCGACTTCT CCACCGTCAT CTGTAAAGGT 144 0 

ACGGACGTAG TAGATAGACT TGATTCCCTT GTTAAAGGCA TAGTTACGAA GGATGGACAA 1500 

GTCACGTGTC GTTTGTTTAT TTTCCCTCTT CCATTCGTAA AGGCCTTTTG GAATGTCACT 156 0 

GCGCATGAAG AGGGTGAGTG AAAGTCCTTG ATCCACGTGT TCAGTCGCAG CAGCGTAAAC 162 0 

ATCGATGACT TTACGCATAT CCATATCGTA GGCAGAAGTG TAGTAAGGAA TGGTTTCTGT 1680 

AGACAAGCCA GCAGCAGGGT AATAGATTTT ACCAATTTTC TTCTCTTGGC GTTCTTCGAT 174 0 

ACGTTGCGTA ATCGGGTGGA TAGAAGCAGA AACGTCGTTG ATATAGCTGA TAGAACCATT 180 0 

TGGCGCTACA GCAAGGCGAT TTTGGTGGTA AAGACCATCT TCTTGAACCT TGTCGCGAAG 1860 

TTCAGCCCAA TCAGCAACAC CAGGGATAAA GACATTTTTG AAGAGTTCTT TAACACGGTC 1920 

TGATGTTGGA ACAAATTCAC CAGTTACATA CTTGTCAAAG TAACTTCCGT TAGCATAGTC 1980 

TGATTTTTCA AAGTTGTGGA AGGTAATACC ACGTTCACGT GCAATATTGT TTGACTCTAC 2 040 

CAAGGTCCAG TAGTTCATAA GCATAAAGTA GATGCTTGTA AATTCAACAG ACTCAGGTGA 2100 

ACCATATTCA ATGAGTTGTT GGGCAAGGTA GCTGTGCAGT CCCATGGCAC CGAGACCAAA 2160 

GGTGTGGGCT TGGCTATTTC CATGGTCAAT CGTTGGTACA GCTACGATAT GTGAACTATC 2 22 0 

TGTAACGAAA GTAAGGGCAC GAACCATAGC ACGGATAGAA CGACCAAAAT CAGGTGAAGT 2 280 

CATCATGTTA ACCACGTTGG TTGAACCCAG GTTACATGAA ACATCTGTTC CCATTTGAAG 2 340 

GAATTCTTGA GCATCGTTGA TCAAGCTTGG TTCTTGAACT TGAAGAATCT CAGAACACAA 2400 

GTTACTCATG ATAATCTTTC CATCAACAGG ATTTGCACGG TTAGCCGTAT CGATGTTGAC 2 460 

TACATAAGGA TAGCCAGACT CTTGTTGCAA TTTAGAGATT TCAGTTTCCA AATCCCGCGC 2 52 0 

CTTGATTTTT GTCTTGCGAA TATTTGGATT TGCGACCAAT TCATCGTATT TTTCAGTAAT 2 580 

GTCGATGTAA TTGAATGGCA CACCGTATTC TTTTTCTACA GAGTAAGGGC TGAAGAGGTA 2 640 

CATTTCTTCA TTTTTACGAG CCAATTCGTA GAATTTATCA GGTACTACAA CACCAAGTGA 2700 
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TAGAGTCTTG ACACGTACTT TTTCATCAGC GTTTTCTTTC TTAGTTGAAA GGAAAGCGAT 27 60 

GATATCTGGG TGAAAGACGT TGAGGTAGAC AACACCAGCA CCTTGACGTT GCCCCAATTG 2820 

GTTGGAGTAA GAGAAGCTGT CTTCAAAAAG CTTCATAACA GGAACGACAC CTGAAGCAGC 2880 

TCCTTCATAG CCTTTGATAG GTGCACCAGC TTCACGAAGG TTGCTGAGGG TAATTCCCAC 294 0 

ACCACCACCA ATACGTGAAA GTTGAAGAGC TGAGTTGATA GAACGCCCGA TAGAGTTCAT 3000 

ATCATCCGTC ACTTGGATTA GGAAACAAGA TACCAACTCC CCACGACGAG CACGTCCAGC 30 60 

ATTCAAGAAG GAAGGAGTAG CAGGTTGGTA GCGTTGGTGG ATGATTTCAT TGGCAATATC 312 0 

GATTGCAACA GCTTCATTCC CATCAGCGAA ATAAAGGGCA TTGAAGAAGA CACGGTCTTC 3180 

CATATTTTCA AGATAGTATT CACCGTCATT AGTCTTTAAG GCATATTGAT TGTAAAATTT 3240 

ATAAGCTGCC ATGAATGACT TGAATTGGAA GTTTTGGTCT TTGATAAATT GAGCTAATTC 3300 

TTCCAAGAAC TCTGGACGGT ATTTCTTGAT AAAGGCTGTT TCGATGTAGT TGTGTTCAAT 33 60 

GAGGTAATTG ATTTTGTCTT TGATTGAATC AAAAACCATA GTGTTTGGAA CTACATTTTC 342 0 

TTTAAAGAAA GCATCCAAGG CTTCCTTGTC TTTATGAAGC ATGATTTGTC CATTAACAGG 348 0 

ACGGTTAATT TCGTTATTAA GACGGAAGTA AGTCACGTCT TCAAGATGTT TTAATCCCAT 354 0 

AAAATTTCCC TTATCTAATT ACAAAAGAAA GGCTTCTAAG TTAGCCCTAA AAGCAGTTTC 3 600 

TTCTGGATGA TGTACTAAGA TTATGCTAAT TGTTTCAGTT TTCCTGGTTG GAAACCTGAA 3660 

AAGACTTCAG TTGGTGTTTG GATAACAGGA GCTGCGCTAA AACCGAGCTC TTTAACTTGA 3720 

TCGACGTACT CAGGTTGCTC ATCAAGATTG ATTTCACGAT AAGAGACATT ATTACTGTCC 3780 

AAGAAACGCT TGGTCATTTT ACATTGGACA CAATTGTTTT TAGAATAAAC GGTTACCATT 3840 

GTGTAACTCC TCTTCAAAAT TTAATACTAT CTTAGTATAT CAGAAAATAA AATTTTGTCG 3 900 

GG 3 902 
(2) INFORMATION FOR SEQ ID NO : 213: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2456 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 213: 

TATTGAAGCT ATTGTAGACT ACAAAGATAA GGATTTGCAG TTAGTAGGCG GTGAGACTCA 60 

CTGATAACCT AAAAAGGATA GTCAATTATG CTTGTTTACT AACTATTAAC TATGCTAAAT 120 
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CAATTGAGGT TGTTTACATA AAACTCTATA TCAGAGAAGC CTGATATAGA 



\ GGATTTTTTT GTAAAATAGA AAAAGTGAAG AGAGGTATGA AATGAGCAAG 
AAAGATAAAA AAATCGAAAT TCAAGTAGCG GATGCCAAAG TTAATGTTGG TAAAGACAGT 
TTTGAAGGTT ATACATTGAC TATCGGTAAA AAAGTTATCG GAGAAATTGC CGAATTAGAC 
GGACAATTTG CCATTATAAA GAATGGGAAT GTCGATAGTT TTTATAAAAA ATTGGAAAAA 
GCTGTGGAAA TTTTGATTGA AAATTATAAT TTAGCAAAAT AAGTCTTGTT TTTTTGAAAT 
TTTCATGATA TAATAGTCCA TGTTGATTGT AGGAGAGATA GCGAAGAGGC TAAACGCGGC 
GGACTGTAAA TCCGCCCCTT CGGGTTCGGG GGTTCGAATC CCTCTCTCTC CATTTCATTA 
ATGGGGTATA GCCAAGCGGT AAGGCAAGGG ACTTTGACTC CCTCATGCGT TGGTTCGAAT 
CCAGCTACCC CAGTTCTTAG GTAATAATCA AGATAGAAAG CAAAATATCT TAGGGTATTT 
TATTTTTATA ATTGAAAGAC GTGAATGATA TGAACATGTC CTTGCGGGTG CTTAGGAAAA 
AAATTATAAG TATGTCAAGT TTAAGAAAAA CTTGATTGTT GGAGGATTTT TTAGATGAAC 
GAATTTGAAG ATTTGCTAAA TAGCGTTAGT CAAGTTGAGA CTGGTGATGT TGTTAGTGCT 
GAAGTATTGA CAGTTGATGC GACTCAAGCT AACGTTGCAA TCTCTGGAAC TGGTGTTGAA 
GGTGTCTTGA CTCTTCGCGA ATTGACAAAC GATCGTGATG CAGATATCAA TGACTTTGTT 
AAAGTAGGAG AAGTATTGGA TGTTCTTGTA CTTCGTCAAG TAGTTGGTAA AGATACTGAT 
ACAGTTACAT ACCTTGTATC TAAAAAACGC CTTGAAGCTC GCAAAGCATG GGACAAACTT 
GTTGGTCGCG AAGAAGAAGT TGTTACTGTT AAAGGAACGC GTGCCGTTAA AGGTGGACTT 
TCAGTAGAAT TTGAAGGTGT TCGTGGATTT ATCCCAGCTT CAATGTTGGA TACTCGTTTC 
GTACGTAACG CTGAGCGTTT TGTAGGTCAA GAATTTGATA CTAAAATCAA AGAAGTTAAC 
GCTAAAGAAA ACCGCTTCAT CCTTTCACGT CGTGAAGTTG TTGAAGCAGC TACTGCAGCA 
GCTCGCGCTG AAGTATTCGG TAAATTGGCT GTTGGTGATG TTGTAACTGG TAAAGTTGCT 
CGTATCACAA GCTTCGGCGC TTTCGTCGAC CTTGGTGGTG TTGACGGATT GGTTCACTTG 
ACTGAATTGT CACATGAACG TAATGTATCA CCAAAATCAG TTGTAACTGT TGGTGAAGAA 
ATTGAAGTGA AAATCCTTGA T CT TAACG AA GAAGAAGGAC GTGTATCACT TTCACTTAAA 
GCAACAGTAC CAGGACCATG GGATGGCGTT GAGCAAAAAT TGGCTAAAGG TGATGTAGTA 
GAAGGAACAG TTAAACGTTT GACTGACTTC GGTGCATTTG TTGAAGTATT GCCAGGTATC 
GATGGACTTG TTCACGTATC ACAAATTTCA CACAAACGGA TTGAAAATCC AAAAGAAGCT 
CTTAAAGTTG GTCAAGAAGT TCAAGTTAAA GTTCTTGAAG TTAACGCAGA TGCAGAACGC 
GTGTCACTTT CTATTAAAGC TCTTGAAGAA CGTCCAGCCC AAGAAGAAGG ACAAAAAGAA 
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GAAAAACGTG CTGCTCGTCC ACGTCGTCCA AGACGTCAAG AAAAGCGTGA TTTCGAACTT 19 80 

CCAGAAACAC AAACAGGATT TTCAATGGCT GATTTGTTTG GTGATATCGA ACTTTAATCA 2040 

AATTGAAAAT TCACAAAATC CTTTGTTTAC TAAACAAGGG ATTTTTCTGG CTCTTTGTCA 2100 

ACTGTAGTGG GTTGAAGAAA AGCTAAGCTC GAGAAAGGAC AAATTTTGTC CTTTCTTTTT 2160 

TGATATTCAG AGCGATAAAA ATCCGTTTTT TGAAGTTTTC AAAGTTCCGA AAACCAAAGG 222 0 

CATTGCGCTT GATAAGTTTG ATGAGATTAT TGGTCGCTTC CAGTTTGGCG TTAGAATAGT 22 R0 

GTAGTTGAAG GGTGTTGACA AGCTTTTCTT TATCTTTGAG GAAGGTTTTA AAGACAGTCT 2340 

GAAAAATAGG ATGAACCTGC TTAAGATTGT CCTCAATAAG TCCGAAAAAT TTCTCCGGTT 2400 

CCTTATTCTG AAAGTGAAAC AGCAAGAGTT GATAGAGCTG ATAGTGGTGT TTCAGG 245 6 

(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 10974 base pairs 
(H) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 214: 

AAATAGGATA TAGAGACATC CTTCTGATCT GCTTTTwACA AAGTCCAATT ATATGCGGAT 

CTATACCTCC ACAATGTCCA TTATTATmCC TAACTATAAT ATGAGCCGAA AACACTATAT 

CCTTAATGTC TCCATATCCA TCAGGGATAT TAATATTTAT TTTTCCACAA CTATATTGCA 

TTGTAACCAT CTCCTTAAAC GACGCATTAT GATATTTGAT AGAGAAATTT TTATGAATAA 

CTCAATAATT TTATAGTAAA TCATGCTTAT ATCTCAAAGA TACCTATTTT ATCTTGTCTC 

GACCTTCTCC AAAGAATTGC TATAATACTA TTACAAATCC ATCTGCACTA CACTTCAAAT 

TTTAGCACTG TATAAAAACG TTTCAATACA CTAACTTCAA GAAAACTTCC ACTATTAATT 

GAAAAAATTG ATAGAGATAA ATTAAAAATC TATATTGAAA CTCATCCCGA TGCTTATTTG 

ACTGAAATAG CTGCTGAATT CAACTGTCCT CCAACAACTA TTCATTACGC TCTAAAGGCT 

ATGGGATATA GTCTAAAAAA GAGCCGTACC TACTGCGAAC AAGACCCAGA AAAAGTAAAT 

CGGTTCCTTA AAGAATTGAA TCACTTAAGC TACCTGACTC CTATTTATAT TTATGAGACA 

GGGGTTGAGA CCTATTTTTA TCTCGAATAT GATCGAGCCT TGAGCAGGCA GTTAGTCTCT 

CTGGAAGAAG ATATAATTAT TTGAATTAAG ATCGAGACAA CGCACACCAG AGATTGCGAT 

3 AAGTACTAAT GCCCTTTTTT GTTTCAATAT ACTATGGCTC CGATGACCTA 
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TAAAGATACG ATGACGAGTG ACTTTTTCGA AGCTTGCTTC CAAAAATTCT TACTACCTAC 900 

TTTAGATACA CCATCCCTTA TCATTATGGA CAATGCAAGG TTTCACAGAA TGAACATGTG 9 60 

TAAGGAGCAG GGCATAGACT GTTACCACTT CCTACCTATT CACCCGAGTA TAATCCCATT 102 0 

GAGAAAATAT GGGCTTACAT CAAAAACATC TCAGAATAAT ATTGTCAAAT TACGATGCTT 1080 

TTCTTGAGGC ACTTTTGTCC TATTCTTGTT TCAGCCGACT ATACTCCGTT ATTGGGCAGC 1140 

TACGGAACAG TCGATGGGAC GATGGGGGGA CATAAAAAAA TCCTCCAGTT TTGTTTTTTA 12 00 

TAACAGTATA CTGGAGAATT GACAATCTCG GTAGATACCT CGTTATAGCG CGGTTACTTA 12 60 

TTAGGCAGTT ACAAAACAAC TGTGAACAGA AAACATTCCA GAGTCAGACA AGACTTTGGA 1320 

ATGTTTTGGC TCTATAATTT CTGTAGTGGG TAATCCCACC CCAGGAATTA TAGGGTCGTT 13 80 

TCTTGTAGAA AAAAAGCCCC ATATGACCTA TAATGAAAAG CGTCTAACCA ACTCATTAGA 14 40 

AAGGGTTCAT ATGGAACAAC TTAAGAATAC CACAGATTTG CTCGGATTGG AAGACAAAAA 15 00 

TATCAAAATC TTGTCTGTTC TGAAATACCA AACCCATCTA GTCGTTCAGG CAAAGTTGGA 15 60 

TTCCCCCGCT CCTCCTTGTC CTCATTGTCA AGGGAAGATG ATCAAATACG ACTTCCAGAA 162 0 

AGCCTCTAAA ATTCCGCTTC TCGACTGTCA GGGTTTACCC ACGGTACTGC ATCTCAAAAA 1680 

GCGCCGCTTT CAGTGCAAGA ATTGCCTTAA GGTGGTCGTT TCTCAAACAT CCATTGTCAA 17 40 

GAAAAATTGC CAGATTTCCA ACATGGTGAG ACAAAAAATC GCTCAGCTCC TCCTTGAAAA 1800 

GCAGTCTATG ACTGAGATTG CCCACAGATT GGCGGTCTCA ACTTCCACCG TCATCCGAAA 18 60 

ACTGAGGGAA TTTAAGTTTG AAACCGATTG GACCAAGTTG CCAAAAGTTA TGAGTTGGGA 1920 

TGAGTATAGC TTCAAAAAGA GCAAAATGAG CTTCATTGCC CAAGATTTTG AGTCCAAATC 19 80 

CATCCTCGCA ATTTTAGACG GGCGAACTCA TGCGGTGATT CGAAACCATT TCCAACGCTA 204 0 

TCAGAGAGAG GTTCGGGAGC TGGTCGAGGT CATCACCATG GACATGTACA GCCCTTATTA 2100 

TCGGCTCGCT AAGCAACTCT TTCCAAAGGC GAAGATTGTT CTTGACCGCT TCCACATTGT 2160 

CCAACATCTG AGCCGAGCTA TGAACCGAGT ACGAATCCAA ATCATGAACC AATTTGACCG 22 2 0 

AAAATCCTTG GAGTATCGGG CGCTCAAGCG CTTTTGGAAC CCTCGCTTTT TCGTTTCTAG 22 80 

GCTCGGGCTA AATCAGTCCA CTGGACTGAT TTACTACACC AGTATAGCTT CAAGCTCTGT 2340 

CAGAAACGAT TCTATCAGCC CACGTTTCGA ATGCACTTAA CCCATCGGGA AGTACGAGAT 24 00 

AAGCTGCTTT CTTACTCTGA GGGATTACAG GTTCACTACG AACTCTATCA ACTCCTGCTC 24 60 

TTTCATTTTC AAGAGAAGAA TGCCGACCAT TTCTTTGGAT TGATTGAGCA AGAACTGCCA 252 0 

ACGGTTCATC CGCTTTTTCA AACGGTCTTT TGGACTTTTT TAAGGGATAG AGATAAGATT 2 58 0 

ATCAACGCAC TTAAGCTGCC TTATTCCAAC GCTAAACTTG AAGCGACCAA TAATTTGATT 2 64 0 
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AAGAT TAT C A AGCGCAAAGC CTTTGGTTTC CGGAACTTTA ACAATTTTAA AAAACGGATT 27 00 

TTGATGACTT TGAACATCAA AAAAGAGAGT ACGAATTTCG TACTCTCCAG ATTGCAGCTT 27 60 

TTCGCCTACC CACTACACTT GACAAAGAGC CACTCTTTAT TCCATGGTAT CAAAGGCAAG 2 82 0 

ACTTGGTTTG GCATTGAGGT CCCAGCCTGC GAAGTTTTCT TTGTTCCACT CGCTGACGCT 2880 

GGCATAGGCA ATCATACCTG CATTGTCTCC GCAGAGTCGC AGAGGGGGGA TGATAACCTT 2940 

GACATCTGTG ATTTCGGCTG CTAGGCGTTC TCTGAGACCT TTATTGGCTG CCACACCACC 3 000 

TGCCACAACT AGGATTTTAA CAGGATATTT CTCCAAAGCC TTCTTGGTTT TTGCCATGAG 3060 

AATGTCCATA ACTGCTGCTT GGAAGGAAGC ACACAAATCT TCTGTAGACA GGCTTTCTCC 312 0 

CTTTTGCTCG GCATTGTGAT GAAGATTGAT AAAGGCAGAT TTCAAACCTG AGAAGGAGAA 318 0 

CTCCAGATTA TCTTCCTTAA TCATGGCACG GGGGAAATCA TAAATATCCT GCCCCTGATG 324 0 

AGCCAGCTCG TCAATCTCAC GACCTGCAGG ATAGGTCAAG CCCATGACAC GGCCGACCTT 3300 

ATCATAAGCC TCACCAACCG CATCATCACG GGTTTCCCCA ACAATCTTAT AATCTCCTGC 33 60 

CTCCGAAACA TAAACCAACT CTGTGTGTCC GCCGCTGACC AAGAGGGCTA GCAAGGGAAA 3420 

CTCCAAAGGC TCCACACTCT GAGCTGCCAT GAGGTGCCCA GCCATGTGAT TAACAGGAAT 3480 

CAGTGGAAGT CCGTGAGCCC AAGCAAAGGC CTTGGCAGCT GACAAACCAA CTAGCAAGGC 3540 

TCCGACCAAG CCTGGTCCGT AGGTAACCGC AACAGCTGTC ACGTCCTCTT CGGTAATCCC 3 600 

TGCTTCTGCC AATGCCTCCT CGATACAGGC TGTAATGACC TCGACATGGT GACGACTGGC 3660 

TACTTCGGGC ACTACGCCAC CAAAACGTTT GTGACTCTCA ATTTGACTAG CAATGACATT 3720 

GGACAAGAGC TCATCGTCGT TTTTCAAGAC GGCGACACTG GTCTCATCAC AGGATGTCTC 3 780 

AAATGCTAAA ATATATCTAT CCTTCATCTA TTTCTCTCTT CATGATAATG GCGTCCTCGA 3 840 

CTGGGTCATG GTAGTAGGCC TTTCGCTCAG CGATAACTGT CATCTTTTCT TTCTTGTAAA 3 900 

ATGCTTGCGC TCGTTGATTT GACTGTCTGA CTTCGAGGAA AATTTCCTTG TCTGTCGGCA 3 9 60 

ATTGAGCAAA CAAGGcTGAC GCAATCCCCT GACCCTGATA AGCTCCTTTG ACAGCGATTT 4 020 

GCAGGACTTC TGCTTCAAAA AGATTCTCCT GCACAGCTAG AAATCCAATC ACTTCTGCCC 4080 

CATCATAAGC CAATGCATAC CAAGTCTGGT CTTGGGACAG ATCTGCTTGG ATTTGCTCCA 4140 

GAGTCCAAGG ACTGACTAGG TAAACAGCTG CCATAACAGC GTAGATGGCT TGAGCTAGGT 4200 

CAGGCTGTTG TTGAATTCGC TTGATTTCTA TCATAGGCGT TTAATGTAAG ACTCGCCAGA 4260 

CTCGGTATGG TTCTTGAGCC AGTTTTCCTC AGCCTCGACT CGTTTGAGGT AATTCGGCAC 432 0 

AAAATCATGC AAGGAGTCTG CTTCCTTGTC CCAGGCCAAA AGAGCTAGAT TAGCTGCATT 4 3 80 
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GGGCAATGTT TCTTTGTAAT CAGTCCTTGG CAAGTGTTTT TGAATCTGCT CAACAAAGGG 4440 

GCCAACTTCT CCGACAAAGG TTACCTGACT AGTACCCTTG ACTTTTTCTA GCACCTCTTC 4 500 

AAAAGATAGG TGCGCTTCTG CCATGACAGG TTTGGCATTT TC ATAAAAT C CTGCATAAAC 45 60 

ATTATTGCGA CGCGCATCCA TCAAGGGGAC AAACAAACCT TCTTGTTGAT GGGGCACCAG 4 62 0 

AGCCAAGAGA CTCGACATAC CAACCAACTC GATGTTCAGG GTGTGAGCTA AGGTCTTAGC 4 680 

AGTTGCTACC GCAATTCGCA AGCCTGTATA GCTACCCGGC CCTTCAGCTA CCACGATTCG 474 0 

GTCCAAATCC TTGGGTGTCC AATCCAAACT TGCCATCAAA AAATCGATGG CAGGCATAAG 48 00 

AGTAATACTG TGATTTTTCT TAATATTAAT CGTCGTCTCG GCAAGAACCT GCTTATCCTC 4860 

TAAAATAGCC AGAGAAAGAG CCTTGCTGGA CGTATCAAAA GCTAATACTT TCATAACACA 4 92 0 

TTCCTATCTT TTTGTCTGCT TACTATTATA CTACAAAAGC TGGCACATGG GAATTTTCTT 4 980 

TGCCCCCAGA CAAGAGTGCC CTCACTTAAC TAAAAATAAT TTAAAAAAAT GCTCACTTTT 504 0 

CCTTTTCTTT TCCGAATATA AAAGTGAACA AGAAAAAAGG AGGAAAGTTC AATGACAAAT 5100 

TTTGACATTC TTGACAATCA ATTTTTATCC TTATCTGAAA ATGAATTATC AGATATTGAT 516 0 

GGCGGTCTCG CTCCCTTGGT TATCTTTGGA GTAGCAGTAT CTTGGAAGGC TATTGCAGGT 52 2 0 

GGAACAGCAC TTATAGGTTC TGGTTTGGCA GCTGGTTATT TTTTAGGAGG AGATTAATAT 52 80 

GATGAAAGAT TTGAACAATT ATCGTGAAAT TTCTAATAAG GAATTGCAAG AAATCAAGGG 5340 

TGGCTTTGGT GTCGGTGTTG GTATCGCTTT ATTTATGGCA GGTTATACCA TTGGAAAAGA 54 00 

CCTTCGTAAA AAGTTTGGTA AGTCATGCTA GATAAGAAAC ACATTTTTAG AAGGATAAAT 5460 

TTTATTGTCT TCATCTCTTA CAGTTTGCTC AGCATTCTCA ATGATTTGAA CATTACTACC 552 0 

ATCCCTTTAC CATTCGATTT ATCTGTTTGT ATTGTTTTAT TTTTATGCTT CAACTCTATT 558 0 

TTTGATCAGA ACAATGACTC CCATAAAAAT AATAAGCTTT GAAAATTCCA TTGTCATGTC 5640 

ATGTTAGAAA AATGCAAAGA CCACCTCATC TTGATAGATG GGGTGGAATT TTCGTGTCGT 57 0 0 

AAATCTACTA TCTCTACATT CCCAAACAAA AAACCCCAGC ATAAGCAGGG CATCTAAGCA 57 60 

TTTAATTCAA AGTAAAATAC AAACCAAACG ACATAGGTCA CGAGGAGGAG AAAAAGCGAG 582 0 

TAGAGAGTCA CAAAGGTCAT TTTCCACAAG AACTTGGTTT GTCGTCGTTC CAGTTTGGCA 5880 

AATAGAAGAT TCCCCGCATA AACGCAAGCA ACAAAAACAA TAAAAGCTAC CAAGCGAGCT 594 0 

CCGATAGCAA AAGCAAATAA GTTATACATA GGGCAACCTC CTTGACTTAA AATCTATATG 6000 

GAATTATGAC AAGCAATAAA TTTCACTTCC GTTATCAACA TAATACATTT TCTTTATTTT 6060 

TGAAAACGCT TACCAAAGAA ATCGTCCCCT AACTTTCTCG TTTCCGTCTT TTACTAATTT 612 0 

TTCATTTTGT GGTATAATTG AAATAATTGT AACGAATCAA GGTCAATCTA GACACAAAAT 6180 
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GGAATGAAAT CAAGCAAATA TCTGCTAAAA GTTTGGAATA AGCTGACCTG TAAATAGAAA 624 0 

GGAACTATAT GATTTACAAA GTTTTTTATC AAGAAACAAA AGAACGTAGC CCACGCCGTG 63 00 

AAACAACACG CACGCTTTAC CTAGACATCG ATGCCAGCTC AGAACTTGAG GGCCGTATCA 63 60 

CTGCTCGCCA ACTTGTCGAA GAAAATCGCC CAGAGTACAA TATCGAGTAT ATCGAACTCT 64 2 0 

TGTCTGACAA ATTGCTCGAT TACGAAAAAG AAACTGGCGC CTTCGAAATT ACGGAGTTCT 64 80 

AATATGGCCT ACACTCTTAA ACCTGAAGAA GTCGGCGTTT TTGCCATCGG TGGTCTAGGA 654 0 

GAAATCGGGA AAAACACTTA CGGAATTGAA TACCAAGACG AGATTATCAT CGTCGATGCT 660 0 

GGGATTAAAT TCCCAGAAGA TGACTTGCTT GGTATCGACT ATGTCATTCC TGACTACTCT 6660 

TACATCGTGG ACAATATCGA CCGCGTCAAG GCTGTTTTAA TCACACACGG ACACGAGGAC 672 0 

CACATTGGTG GGATTCCGTT CCTACTCAAG CAAGCAAATG TCCCTATTTA TGCTGGACCG 678 0 

CTTGCCTTGG CTTTGATCCG TGGGAAACTC GAAGAACACG GCCTCTTGCG CAACGCCAAA 684 0 

CTTTACGAAA TCAACCACAA CACCGAGTTG ACCTTTAAAA ATCTCAAGGC AACTTTCTTT 6900 

AGAACGACTC ACTCTATTCC AGAGCCTTTG GGGATTGTCA TTCATACTCC TCAAGGGAAA 6960 

ATCGTCTGTA CGGGTGACTT TAAGTTCGAC TTTACTCCAG TTGGAGAACC TGCGGACTTG 702 0 

CATCGTATGG CTGCGCTTGG TGAAGAAGGC GTGCTCTGTC TCCTGTCTGA CTCGACAAAT 7080 

GCGGAAGTAC CAACCTTTAC CAACTCTGAA AAAGTCGTTG GTCAGTCCAT TATGAAGATT 7140 

ATCCAAGGTA TTGAAGGACG TATCATCTTT GCATCCTTTG CCTCAAATAT CTTCCGTCTC 7200 

CAGCAGGCAA CAGAAGCTGC TGTTAAGACT GGACGCAAGA TTGCGGTCTT TGGTCGTTCT 7260 

ATGGAAAAGG CCATTGTCAA CGGAATCGAT CTTGGCTACA TCAAAGCTCC TAAGGGAACC 732 0 

TTTATCGAGC CAAATGAAAT C AAAG AT TAT CCTGCAGGAG AAGTTCTTAT CCTCTGTACA 7380 

GGTAGTCAGG GTGAGCCTAT GGCAGCCCTC TCTCGTATCG CCAACGGAAC CCACCGTCAA 7440 

GTACAATTAC AACCAGGTGA TACCGTTATC TTCTCTTCTA GTCCCATCCC TGGAAACACT 7500 

ACTAGTGTCA ACAAGCTGAT TAACATCATT TCTGAAGCTG GTGTCGAAGT TATCCACGGT 7560 

AAAGTGAACA ATATCCATAC ATCTGGACAC GGTGGTCAGC AAGAGCAAAA ACTCATGCTC 7 62 0 

TGCTTGATTA AGCCAAAATA CTTCATGCCT GTCCACGGTG AATACCGCAT GCAAAAAGTC 7 68 0 

CACGCTGGAC TAGCAGTGGA TACTGGTGTT GAGAAGGACA ATATCTTTAT CATGAGCAAT 7740 

GGCGATGTGC TTGCCCTTAC TGCTGACTCA GCTCGTATCG CAGGTCATTT CAACGCCCAA 7 800 

GATATCTATG TCGATGGAAA TCGTATCGGT GAAATTGGCG CAGCTGTCCT CAAAGATCGT 7860 

CGCGATCTAT CTGAAGACGG TGTCGTTCTG GCAGTTGCAA CTGTTGACTT CAAATCGCAG 7 920 
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ATGATTCTAT CTGGTCCAGA CATCCTCAGC CGAGGCTTTG TCTACATGAG AGAGTCTGGC 7980 

GACTTGATTC GCCAAAGCCA GCGTATCCTC TTCAATGCCA TTCGTATCGC ACTGAAAAAT 8040 

AAGGATGCTA GCGTGCAATC TGTCAATGGT GCCATTGTCA ACGCTATTCG CCCCTTCCTC 8100 

TATGAAAATA CCGAACGTGA ACCGATCATC ATCCCGATGA TCCTCACACC AGATGAAGAA 8160 

TAAAGCAAGA AAACAGCCCC GTCCTCGGAG CTGTTTTTCT CTATGCTTTC TTTTGAGATT 822 0 

AAAACTCATA CTCAATGAAA ATCAAAGAGC AAACTAGGAA GCTAGCCGTA GGTTGCTCAA 8280 

AGCACTGCTT TGAGGTTGTA GATAGAACTG ACGAAGTCAG TAGCCATACC TACGGCAAGG 834 0 

CGACGTTGAC GCGGTTTGAA GAGATTTTCG AAGAGTATCA ATAAAAATCG AAATCAGACT 840 0 

AGAAGGCTAA GCGAAAGCAT AACTTGAGTT AGCTCCCATA GTTCGGGAAA CTATGGGAGG 8 4 60 

CTGGAGATGA ATCAAAGCCA AGCTTTGAAC TCATTCGTAA GAAGCCGACG ACGTATCATT 8 520 

TTGATTTTTG AAGAGTTTTA GAAATACTAC GATTTTTACC TTCCAGATAC ACCATCAAAA 8 580 

TAGAAATATC TGCTGGGTTT ACTCCCGAAA TACGGCTGGC TTGGCCGATG GTTTCTGGAT 8640 

TGATGAGTTT GAACTTCTGA CGGGCTTCGG TTGCGATAGA ATCAATGTCA TCCCAGTCGA 8700 

TATTGGCCGG AATGCGTTTT TCTTCCATGC GTTTCATCTT GGCAACCTGG TCCATGGCTT 8760 

TGGAAATATA GCCTTCATAC TTGATTTCTG TTTCAATCAA TTCGATAATC TTGTCATCCA 8 820 

AGTCTTCTGC AGCTGGTCCG ATGAAGGCCA CCACATCTTG GTAAGAAACT TCTGGACGGC 8 880 

GAAGGAATTC CTTGGCTGTC ACTGCATCGG TCAAGGGTTT GAAGCCCATC TCCTCAACCT 8940 

TGGCATTGGT TTCCTTGACT GGCTTGAGTT TGATACTGTC TAGGCGCTTC ATCTCATTAT 9000 

CAAATTGATT TTTCTTGATT TCAAAACGAG CCCAGCGTTC ATCGTCCACA AGGCCAATCT 9060 

CGCGTCCCAT CTCAGTCAAG CGCATATCAG CATTGTCATG ACGAAGAATG AGACGGTATT 9120 

CAGCACGACT GGTCAAGAGA CGGTAGGGTT CAATGGTTCC CTTGGTCACC AAGTCGTCGA 9180 

TCATCACCCC GATATAACCA TCACTGCGCT TCAAAATCAA TTCAGGCTTG CCTTGGATTT 9240 

TCAGAGCCGC ATTGATACCC GCGATAATCC CTTGGCCTGC TGCCTCTTCG TAACCTGATG 9 300 

TTCCATTTGT CTGACCAGCA GTGAAGAGAC CTGAGATTTT CTTGGTTTCC AAAGTCGCAC 9 3 60 

GCAACTGATG AGGCAAGACC ATATCATACT CAATAGCATA ACCTGTCCGC ATCATCTCTG 9420 

CATTTTCCAA ACCTTTGATG GAATGCACCA AGTCACGCTG GACATCCTCA GGCAGACTGG 9480 

TTGAAAGTCC TTGCACATAG ACTTCCTCAG TATTGCGCCC TTCTGGCTCA AGGAAGAGTT 9540 

GGTGACGTTC CTTGTCCGCA AAGCGCACAA TCTTGTCTTC AATCGACGGA CAGTAACGAG 9 600 

GCCCCACTCC CTTGACCACA CCTGTAAACA TAGGCGCACG GTGGAGGTTG TTTTGGATAA 9 660 

TCTCATGACT GGTACCATTG GTATAGGTCA ACCAGCATGG TACTTGGTCC TTGACATAAT 9720 
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CCTCATCACG TGAAGTGTAT GAGAAATGAT TAGGCACTTC GTCTCCTGGC TGAATTTCTG 97 80 

TCACATCGTA ATTGATAGAA GAAGCCTTGA CACGTGGAGG GGTTCCTGTC TTGAAACGAC 984 0 

CGATTTCGAG ACCCAGTTCC TTGAGATTGT CAGCTAGGTT AATAGAAGCC AAGCTGTGGT 9900 

TAGGACCTGA TGAGTACTTG AGGTCTCCGA TGATAATTTC CCCACGGAGA GCAGTCCCTG 99 60 

TCGTCACAAT AACAGCCTTA GCAGCATATT CTTGATGGGT GGCTGTACGC ACACCGACAA 1002 0 

CCTTGCCATC TTCCACCAAA ATCTCATCAA TCATGGTTTG ACGAAGGGTC AGATTTTCTT 10080 

GGTTTTCAAC CGTCTTGCGC ATCTCCTTAG AGTAAAGTTC CTTGTCAGCC TGCGCACGAA 10140 

GGGCACGGAC AGCTGGCCCC TTCCCTGTGT TTAGCATCTT CATCTGGATG TAAGTCTTGT 10200 

CAATGGTTTT GGCCATCTCG CCACCGAGGG CATCGACTTC ACGCACGACA ATCCCCTTGG 102 6 0 

CAGAACCACC GATAGAGGGA TTACAAGGCA TGAAAGCCAG CATTTCAATA TTGATGGTCG 1032 0 

CAAGCAGGAC CTTACAGCCC ATACGGCTAG CGGCCAAGGA AGCCTCAACC CCAGCGTGTC 10380 

CCGCACCAAT TACAATAATA TCGTATTCTT CAGTAAAATG ATAAGTCATG TTTCTCTCCT 10440 

ATTCCTCAAG ATGAATGTGT CTTAGTTGGC CTTCCCAATC TGGTAGGGCT GTTTTTAAAA 10500 

AGACTGGAAC TAGCTGGATA TTCTGGAGCT TATCCAAGTC AATCCACTCA GAt-GGCTGCC 10 5 6 0 

TTTTCTCATC TTCCTGCATG GTCAACGGGG CATCTTCAAG CAAATCCACC AGATAATGAA 10 620 

ACTCGATATT GTGATAGGAA ACGCCGTCCA CTTCAAAACG ATTTTCAACC ACAAAAGCTA 10580 

GCTGCCCAGC TTGAGCTTTG ACACCCAGTT CTTCCTTCAC TTCACGGACT ACCGCGTCTT 10740 

CCGTGCTTTC ATTGACTTGA ATCGCACCTC CAATAGTGTA ATACTTGCCC TTGTCTTTGG 10800 

TAACTAGAAG CTTGTGATTT TGGACAATCA AGGCTGTAGC CCGAACACCA AAAACCGTAT 10860 

TGTCTACTTT TGTCCGAAAG TCTTGTTGAG TCATTCTTGT CCTTTCCCTT AAACGACACA 10920 

AAAACAGTCA AAACTACAAA GAAGTGCAGG ACAAAAAAGC CTGCAACATC CAGG 10974 
(2) INFORMATION FOR SEQ ID NO: 215: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 987 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

CCCGTTATGA TTATGGATAG CGCTTTCAAA TTTTTAAACT CCTATCCCAT CCTTTTATCT 60 

ATATAATAAG TGAAAATATA ATAACTGTCA AGTAACTGAA GTGAATTTTA TAAAAAAATT 120 
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ACAAGCCAAA TTTGTAAAGT TTACACTAAG CCGCTAGgCA ATCGTCTATC AGAATATCCG 
TTTATTTGTC AATAATCCGA GAAAATCTTG CAACGCTTAG AAGTCTATAA AAACTATCAA 
CATTTATATG ACTTGCGAAT AGCAATCCTG CTAAACCTTT CCACACTCTA TCTATACAAT 
CAAGATAAAA ACATGTGTAA GCAAATCTGC TACACTTTAC TGGAGGACGC CAAGAATAAG 
AAAAGCTACG ATAGGCTTGC TATCTGCTAT GTCCGTATTG GGATTTGTAC AGACGATTCT 
AAACTTATCC AAAAAGGGTT CTCCCTTCTG GAGCTGACCG AGGAAACTTC TATGCTGTCT 
CATCTCAAAA AAGAAGTAGA GACCCATTAT CAACCAAAGA AATTATAAAA AAAGTCGAGG 
GAGCTCCTCG ACCTTTTCAT AGAATCGCCG AACGATTTAA CGAGAAAGTA TGACTTTTAC 
GTTTATCCCA ACTCAATTAT GACATTTTTT TCAAAAGTCA ATATATCTCA CTTTTTCAAC 
GACAAGAAAG AGGCTGATAA TCTACCAACC TCTTATTCTG AACCCATCAC TCCATCACTT 
TTTAGCTTCA TTCGCTTTCT TAGCGACTGC AATCTGGTAT TCGACTTGGT CATTCCCCTT 
ACCGGTACAA CCATGAGCAA TTGTAGTCGC TCCTATCTGA TGCGCTATTT CAACCAATTT 
TTTAGAAATC AGAGGGCGGC TCAAGGCAGA TACCAAGAGA TACTTTTGTT CATAATAGGC 
ATGTGACTGA TGAGCCACTA GCACATAATC TGTAGCAAAT TCGTCCTTAA CATCAATGAC 
ATAAGATTCT ACTGCCCAAA CCTTAAG 
(2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2651 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

CTGGGTCTTG TTCATAGTAG GTGTGGTtCT TTTTTTCGAG TGTAGCCCAT AGCTTTGAGC 6 0 

GCATAGTGGA TGGTAGTTGG ATGACAGCCA AAGTCAGAAG CTATTTCAGT CAAATAAGCA 12 0 

TCTGGATTGT CAGTAAGATA GTTTTTAAGT CTATCTCTAT CAACTTTTCT TGGTTTTGTT 180 

CCTTTTACTT GGTGGTTTAG CTCTCCTGTT TTCTCTTTTA GCTTTAACCA GCCATAAATG 2 40 

GTATTACGTG AGATTTGGAA AACGTGTGAT GCTTCTGTTA TACTACCTAT TCGCTCACAA 300 

TAAGAGAGAA CTTTTTTACG AAAATCTATT GAATATGCCA TAAGAAGATT ATACCACATT 3 60 

GTGTACTATT TTTGGTTCAT TTTACTATAT TTTATAAGTT ATAGTGTAGC ATTCCAACTT 420 

CAAAGCACTA TAAAGTAAAT TGAAACAAGA ACAATACAAA CAATTCTCGT AAACGGATTG 4 80 

CAACCACAAA AAAGCAAGCA TTCACAAGAA TACTTACCTA TCATGGGAGG AACAACCGTT 54 0 



WO 98/18931 



PCTYUS97/19588 



1221 

CCTCTTTTTT ATTACTAAAA TTCAAAGAAT TCCAATGCTT TTTTCAAGAG CAAATCCGTA 600 

TATTCTGGAT CTTCTTGGGC TACTTCTATT TCCCGCTGAA CTTTTTCCAA ATCATCTGTA 6 60 

ATCACTCCAT CTACTCCTAA GTGAAGAGAT TTGCTGATAG CTTCTGAATC ATTGACAGTC 720 

CAGACATAAA GTTTCTGATC CGTTGTCCAT AGTTTGCTTA CAAAATATTC ATCCAAGGTT 7 80 

GAGTACTCCA TAGTATATCC TGTCGCTCTT GTTTTAGGAA AGACAGAATT GTAGGGCATG 840 

AT G AAAT AAA CTGGTAGTTC GGCATCATAC TGTCTTACTT TTTCGACAAC ATGGTAGTCT 900 

AAAGACTGGA TTTGATGTCC ATAAATCTTG AGCTTTGCAG CATAACGGGC TAAAAAGCGG 960 

TTCATCATGT CTGGACTATC TTTTTTACTG GTTTTAATTT CAATTAGTAA TTTTTGACCA 1020 

AGTTCGTTGG CTCGACTGAG ATAATCTTCA AAGCTTGAAA TTTTAGTCTG GTAGCCATTT 1080 

TCAAAAATAT CAATCCCTTT AAGCTCCTCC AAGTTTAAGT CTTGAGGACT TTTATTGATA 114 0 

CCTGCTAGAT TTTTCAAGTT AGCATCATGC ATCATGACAA ACTGCCCATC TTTTGTTTCC 12 00 

TGCACGTCCG TCTCCACCAA GTCTGGTTTG AGTTGTGCTG TAGTTTCCAA GGACTCTACT 12 6 0 

GTATTTTGAA TCCCATTTGC ATTGGAAACC CCTCGGTGAG AAATAAGTTG AGGTAGATGA 132 0 

ACCATGGGAG CCTCCAGATA AATATAACCT TCTAAGGCAA AGAAAAGACT GGCACAAGTC 13 80 

ATGACACCCC ATCGCACGAT GTGATCTTTT TCTCTCCTAG GAAGCATATC CAGCTCCTTT 14 4 0 

CCTGTCAAAA ATGAAACAAA TTTAACCAAA AAATAAGTCA GAGCCATATA ATAGAGATTT 1500 

TTAATCACGA CAAAATTCAA AATACCAAGA ATCAGAGACT CTCTCTGAGT GATATCATCT 1560 

ACCAAAGTTT GAGCCAATAA TAAAGGAATC AAAGGAAGAT ArAATAATAA ATGTGCTTTG 162 0 

AGCAAGATGT AAAATAAATT CCAAGCATAA AAAGTAACTC TCTTCTTGGT TTTCTCCAAG 1680 

CTAAACATCA CTGCTTCTCG AACAGTCAGC TGATCATATA CAATCTTCGG AAGGGCAAAC 1740 

ATCAATCTGA CAGAGACATA GAGAAAGATA AGAGATAGAA GTAGGATGCT CAGCCACCAC 1800 

ATCCAATATC TATCTTCTAA ATAAGCTTGG ATAAACTCTG GAATGACGAT TTTATTAAGA 1860 

TAATAAATCT TCAGCATTTT CCGTATAAAA GGAAACAGCA TAGCTATATA GAAAAAGATA 192 0 

AACAAGGCTT TAGCGCAAGT TAGCTTTTTC ATAAATCCAA AACTTTCATG GAAAACCTTG 19 80 

CGGATATACT CAATTAGCCT TCGCTTTTCA TTATAGAGGA GATGACGAGC ACCAATAAAG 2040 

AGGAGTCCTA TTTGAAAATA AGCAACCAGA AGGTTAATTA CAATCAAGGC TAAAAAAGCT 2100 

AGACTAATCA ATGGAGAATG AGTAAGGATG GCTAAGACAT TGTTATAGGA AATAAAAAGA 2160 

TAACCTGTCT GATCTAATAA GAAGCTAGCC AACCATGAAT TGAATGGTAC CCACAAATAC 222 0 

TCCACTATCA TAAAAATCAA GAAAAATAGA AAGAGGATTT TATCAAGATC GAGGTAAATC 22 80 
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TGTTTAAGAC CCAATTTTTT AGGTTTTTCA GGTTTCATAG GCACTCCTAG TCAAATAATT 2340 

GAGACAAGTC CAAGCCACCA AAAGGATTGT TTGATAAGCT ACTTTCTGTC TCTAACAATT 2 400 

CCCTAGCTTG ATCCGACTCT AAGAAGGATT CGTAAACACG CGCCGTCATC CGAGCATCCT 2460 

CTAAACTATT ATGAGACTGA CCTTGAAATC CAAGAAATGA GGCAACAGTT TGCAATTTGA 2 52 0 

GATTGGCAAT ACCATGTAAA TCTGAACTCC GACGTTCAAA AGCTTCATCA TACAAATCCA 2580 

CCTTGTACTG TTGGCTATAG TCTAAACCAT GCTCTGCTAA AATAGGTAAA TCACTTTTAG 2 640 

CAGCATTGTA G 2651 
(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

CGTTATAATA AACTTGTGAA AAAATTAACA AAGGATATCG TTCCTTGAAA GCTATGGAGG 6 0 

AAAATATGGC TGATAAAAAA ACTGTGACAC CAGAGGAAAA GAAACTCGTT GCTGAAAAAC 12 0 

ACGTAGATGA GTTGGTTCAA AAAGCTCTAG TTGCCCTTGA AGAAATGCGT AAATTGGATC 180 

AAGAACAAGT TGACTACATC GTTGCCAAAG CATCAGTAGC AGCTTTGGAT GCCCACGGAG 24 0 

AATTGGCTTT ACATGCCTTT GAAGAAACAG GACGTGGTGT ATTTGAAGAC AAAGCAACTA 3 00 

AGAACTTGTT TGCCTGTGAA CACGTAGTAA ACAACATGCG CCACACTAAG ACAGTTGGCG 3 60 

TTATCGAAGA AGACGATGTA ACAGGATTGA CTCTTATTGC TGAACCAGTT GGTGTTGTTT 42 0 

GTGGTATTAC TCCAACAACA AACCCAACAT CAACAGCAAT CTTCAAATCA TTGATTTCAT 4 80 

TGAAGACACG TAACCCAATC GTCTTTGCCT TCCATCCATC AGCACAAGAA TCATCTGCTC 54 0 

ATGCAGCTCG TATCGTCCGC GATGCAGCTA TCGCAGCTGG TGCTCCTGAA AACTGTGTGC 600 

AATGGATTAC TCAACCATCT ATGGAAGCAA CAAGTGCCCT TATGAACCAC GAAGGTGTTG 6 60 

CGACAATCCT TGCAACAGGT GGTAATGCCA TGGTTAAGGC GGCTTATTCA TGTGGTAAAC 720 

CAGCTCTTGG GGTAGGTGCC GGAAACGTTC CAGCTTATGT TGAAAAATCA GCAAACATTC 780 

GTCAAGCAGC ACACGATATC GTCATGTCTA AATCATTTGA TAACGGTATG GTCTGTGCAT 84 0 

CTGAACAAGC AGTTATCATT GATAAAGAAA TTTACGATGA ATTTGTAGCA GAGTTCAAAT 900 

CTTACCACAC TTACTTTGTA AACAAAAAAG AAAAAGCTCT TCTTGAAGAG TTCTGCTTCG 9 60 

GCGTCAAAGC AAACAGCAAA AACTGTGCTG GTGCAAAATT GAACGCTGAC ATCGTTGGTA 102 0 
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AACCAGCAAC TTGGATTGCA GAACAAGCAG GATTTACAGT TCCAGAAGGA ACAAACATTC 10 80 

TTGCTGCAGA ATGTAAAGAA GTTGGCGAAA ATGAGCCATT GACTCGTGAA AAATTGTCAC 1140 

CAGTTATTGC AGTTTTGAAA TCTGAAAGCC GTGAAGATGG TATTACTAAG GCTCGTCAAA 12 00 

TGGTTGAATT TAACGGTCTT GGACACTCAG CAGCTATCCA CACAGCTGAC GAAGAATTGA 12 60 

CTAAAGAATT TGGTAAAGCT GTTAAAGCTA TTCGTGTTAT CTGTAACTCA CCTTCTACTT 132 0 

TTGGTGGTAT CGGGGACGTT TACAATGCCT TCTTGCCATC ATTGACACTT GGATGTGGTT 13 8 0 

CTTACGGACG CAACTCAGTT GGGGATAACG TTAGTGCCAT TAACCTCTTG AATATCAAAA 1440 

AAGTCGGAAG ACGGAGAAAT AACATGCAAT GGATGAAACT TCCTTCAAAA ACATACTTTG 1500 

AACGTGATTC AATTCAATAC CTTCAAAAAT GTCGTGACGT TGAACGTGTC ATGATCGTTA 15 SO 

CTGACCATGC CATGGTAGAG CTTGGTTTCC TTGATCGTAT CATCGAACAA CTGGACCTTC 1620 

GTCGCAATAA GGTTGTTTAC CAAATCTTTG CGGATGTAGA ACCGGATCCA GAT AT CAC AA 1680 

CTGTAAACCG TGGTACTGAG ATTATGCGTG CCTTCAAACC AGATACCATC ATCGCACTCG 174 0 

GTGGTGGGTC TCCAATGGAT GCTGCCAAAG TAATGTGGCT CTTCTACGAG CAACCAGAAG 18 00 

TGGACTTCCG TGACCTTGTC CAAAAATTCA TGGATATCCG TAAACGTGCC TTCAAGTTCC 1860 

CATTGCTTGG TAAGAAGACT AAATTCATCG CGATTCCAAC TACATCTGGT ACAGGATCTG 1920 

AAGTAACACC ATTTGCCGTT ATCTCTGATA AAGCAAACAA CCGTAAATAC CCAATCGCTG 198 0 

ACTACTCATT GACACCAACT GTGGCAATCG TAGATCCTGC TTTGGTATTG ACAGTTCCAG 2040 

GATTTGTTGC TGCTGATACT GGTATGGACG TATTGACTCA CGCGACAGAA GCATACGTAT 2100 

CACAAATGGC TAGTGACTAC ACTGATGGTT TAGCACTTCA AGCCATTAAA TTGGTCTTTG 2160 

AAAATCTCGA AAGCTCAGTT AAGAATGCAG ACTTCCACTC ACGTGAGAAA ATGCATAACG 2220 

CTTCAACAAT CGCTGGTATG GCCTTTGCCA ATGCCTTCCT AGGTATTTCT CACTCAATGG 2280 

CCCATAAGAT TGGTGCGCAA TTCCACACAA TCCACGGTCG TACAAATGCT ATCTTGCTTC 2340 

CATACGTTAT CCGTTACAAC GGTACACGTC CAGCTAAGAC AGCAACATGG CCTAAGTACA 2400 

ACTACTACCG TGCAGATGAA AAATACCAAG ATATCGCACG CATGCTTGGA CTTCCAGCTT 2460 

CTACTCCAGA AGAAGGGGTT GAATCTTACG CAAAAGCTGT CTACGAACTC GGTGAACGTA 2 52 0 

TTGGGATCCA AATGAATTTT AGAGACCAAG GAATTGACGA AAAAGAATGG AAAGAACATT 2580 

CTCGTAAATT AGCCTTCCTG GCTTATGAAG ACCAATGTTC ACCAGCTAAC CCACGTCTTC 2 640 

CAATGGTAGA CCATATGCAA GAAATCATCG AAGATGCATA CTATGGCTAC AAAGAAAGAC 2700 

CAGGACGCCG TAAATAATTG TTTATCAGTC TAGAAGCAAG ACAAAAACTC AATTTGAGGG 2760 



WO 98/18931 



PCT/US97/19588 



1224 

AAAGATCCAG TAATTTTTCT ATGATAAAAG GCATCCTATC AAGGTTTTTG AACACCTGAT 2 82 0 

AGGATGCCTT TTTATGATAT TGAGGCCTTT TTGCCCTTTT TGAAAAACTA GAATAGAAAC 2 880 

AAAATATATA ATAGATTGAA ACTAGAATAG TACATATCTG CTTCTAAAAC ATTGTTAGAA 2940 

TTCGATTTGA CTGTCCTGAT CGATTTGTCC TGTTCTTATT TCATTTTGAT ATATAAAAAA 300 0 

TATAGTATAG TAGACTGAAT CTAAAATAGT ACGAAACAAT TGCTAAAACA TTTATAGAAA 3060 

TTAATTTTAC TTTTCTGATA GAGTTGTTCA CATCTTATTT CAATTCACTA TAGTTTAATT 312 0 

TAAGAGTAGT ATTTACTAAG GCCCAATTAA AATCAAAGAG CAAACTAGAA AACGAGTGCC 318 0 

ATTCAGCTCA AAACACTGAT TTGAGATTGC AGATAAGACT AGCCCCCTCA TTAACAGATT 3240 

TACGATAAAA CGATGACAAG GTGTGTTGCT TTTTGATTTC TAAAGAGTAT AATGATAGAT 33 00 

CTCTATAAAA TAAGTGCGAA GGAAATGAGC TTTTATAGTC CTTTCGTTTT AAAATACTAT 33 60 

CTCAGATATT CTTATATCGA CAAGAAGTTT TTGAGTCATT CCCTCATCAT ACATATTAAA 342 0 

TAAATAGTGG CTCATTCAAT TTTTCACTAG AATAATAAGC TAGTATAGTA AACTGAAATA 34 80 

AGATATAAAC AAATAAATTG GAGCTTAACA TCCATTTCCA GCAATTTTTT AGAAACTACA 354 0 

GTGGACTATT CTAGATTCAA CAT ATT AT AA AAACTAGAGT AAAAGAAAAG GATTGGATCT 3 600 

TGTGTAATGC AGGATCCAAT CCTTTCAATC ATTTTGTCCA ACTTTTGGAG GTTCCTACAA 3660 

TGTAGTCGTC ATTAATAAAG ACAGATGGGA ATGACAGTGT TCCTATTTAT TTTGATAGAG 372 0 

ATCGATGAAT TCTTTAGATA GCAACTGAAT AATCTCTGTT GAAGCCATTT GGTCTTCTGC 3780 

ATGCATAAAT AGCAAGGAGA ATCCTATTTT TTCTCCAGTA GCTTCTTTTT GTATGAGATT 3840 

AGAGTGAATC TTGTGCGCTT CTACTAAGGA GTCTTCCGCT TCTTCAACTT TAATTTTCGC 390 0 

TTCTTTTAAA TTTCCTGCCT TAGCTAGTTG GATGGCTTCA ATAAAGGATG ATTTGGCTGC 3960 

TCCACTATTG GCAATGAGCT GAAAACAGAT ATATTCCATT TCTTCTGTCA TCTTATTTCT 402 0 

CCTATCCATG CAAGTGCTTG TTCCAGAACT TTTGCTCCAT TCATCATTCC GTAATCCCGC 4080 

ATATCAATGG TATCTACAGG GATATTTCCT GCAATTTCTT TCACAGCAAG TAACTCATAA 414 0 

CGAATTTGTG GCCCAATTAG AATGACATCT GCTTCATGGA TATTCTTTTT AGCTTCTGTC 420 0 

ATTGATTTTG CTTGGATAGA GATTTCAATC CCACGTTCAG TCGCACTTTG TTGCATTTTT 42 60 

TTAACAAGCA TACTTGTCGA CATTCCCGCA TTACATACTA ATAAAATTTG TTTCATAATC 432 0 

TTAACCTTCC ATTTCTTGTT CAACAACTTT GTCATTAACT TTGATAAATG GAATGTATAG 4380 

AAGAACTCCA AGTGCAAAGA TGATGAATTG AACTAGAACT GCTCTCACGT CCCCTGCTGT 4440 

TGCTAACCAT GCATTTAAGA AT AC TGGTGT AGTCCAAGGA ACTTGTATAA ATGCAGGACT 4500 

CATGAATTCT GTAACTGTTG CTAAGTAGCT GATTAAAATA CCAAGGACTG GAACTGTGAT 4560 
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AAATGGAATA GCTAATGAAA TGTTATAAAC GATTGGGTAA CCGAATAATA CTGGTTCATT 462 0 

GATATTGAAG ATACCAGGTC CAAAAGATAA TTTAGCCACG TTTTTAGAGA CAGCATTGCG 4 680 

ACTCACTAAG AATGTTGCTA TTAATAAACA TAATGTAGAT CCACTACCAC CCATTAAAGC 4740 

GAATGTTTGT ATTTGTGATA GGTTGATGAT GTGTGGAATG GCTTGTCCAT TATTTGCTGC 4800 

AGTGATGTTT TCAGTAATGT TAATTAATAG TAATGGTTCT AGGATGGCAC TGTAAATAAC 4860 

TGCTTGGTGA ATACCAAATA GCCATAACAT ATTTCCTAAA GAGTAAATAA TAATGACCCC 492 0 

GATTAAGCTT GTACCAATAT GACGAATTGG TTCTTGAATA AAGATTGTAA TGATTGAGAT 4980 

TAAGTTCATT CCAGTTATAT TGAATAATAA TGCTGAAACA ACCCCAAATA AGGAGATGAC 504 0 

GGTCATGACT GGAAGTAATA CGCTAAATGA TCTACTAACA GCTGGTGGAA TATTTTCACC 5100 

AAGGTTCATT TGTAAAGCTT TAACGTTTGA TAATTCAATG AATAATTCTG TTGCAATAAT 5160 

CG t ACGATAA CCCCGGCGAA CATTGCGCCT GTACCTGTGT TGTTGAATGA AAGAACACCT 522 0 

GAAATGTTTA CCGCATCTTT TGCTCCGTCA GGAACTACAG AAACTGTATT TGGCATCATC 5280 

ACAATTAAAG AAACTAATGA TAGCATTGAT GCTGCTAACG GGTTTTCGAA ATCTCTGTTT 5340 

TTAGCTAAGA AATAACCAAC CATTACAGCA ATAATCATAC CTGAAATACT TAAAGTACCG 5400 

TTTGCAATTG TTATTCCCCA ATATTGGAAT CTTGTTAATG TATCCCCTTG GAAAATCCAC 5460 

TTAAATACCG TGTTGTTCAA AAGAACGATT AAACCTGCCA AAATATATAA TGGCATTACT 5520 

GTTACGAATG CATCTCTTAG GGTTTTTAAA TGAATTTGGT TCCCTAGTTT ACCAGCAAAG 5580 

GATGGCAAAA AAATTTTTTT GGGGGGGGGG GTTATTAAAC CCCCCTTTTT AAAAAAAA 5638 
(2) INFORMATION FOR SEQ ID NO: 218: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4745 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

CCGGAAGCTG TTGCCCTTGG AACTCCAAAT GAAGAAACAG CCTTTGTCTT GAACTATTTT 60 

GGTGTGGAAG CACCACGTGT TATCACTTCT GCCAAAGCAG AGGGGGCAGA GCAAGTTATC 120 

TTGACTGACC ACAATGAATT CCAACAATCT GTATCAGATA TCGCTGAAGT AGAAGTTTAC 180 

GGTGTTGTAG ACCACCACCG TGTGGCTAAC TTTGAAACTG CAAGCCCACT TTACATGCGT 240 

TTGGAGCCAG TTGGATCAGC GTCTTCAATC GTTTACCGTA TGTTCAAAGA ACATGGTGTA 3 00 
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GCTGTGCCTA AAGAGATTGC AGGTTTGATG CTTTCAGGTT TGATTTCAGA TACCCTTCTT 360 

TTGAAATCAC CAACAACACA CCCAACAGAT AAAATCATTG CTCCTGAATT GGCTGAATTG 42 0 

GCTGGTGTGA ACTTGGAAGA ATATGGTTTG GCAATGTTGA AAGCTGGTAC CAACTTGGCT 48 0 

AGCAAATCTG CTGAAGAATT GATTGATATC GATGCTAAGA CTTTTGAACT CAACGGAAAT 540 

AATGTCCGTG TTGCCCAAGT GAACACAGTT GACATCGCTG AAGTTTTGGA ACGCCAAGCA 600 

GAAATTGAAG CTGCAATGCA AGCTGCCAAC GAATCAAACG GCTACTCTGA CTTTGTCTTG 660 

ATGATTACAG ATATCGTCAA CTCAAACTCA GAAATCTTGG CTCTTGGTGC CAATATGGAC 720 

AAGGTCGAAG CGGCTTTCAA CTTCAAACTT GAAAACAATC ATGCCTTCCT TGCTGGTGCC 780 

GTTTCACGTA AGAAACAAGT GGTACCTCAA TTGACTGAAA GCTTTAATGC GTAAGATTTT 840 

GGGTGTCAGC TCAAAATCGG AAAGTCTAGT TTGCCTTATA TCGCAAGGAG TTTCGGCTCC 900 

TTTTTTCTAG GAGTGAAGTA TGTTAGAAAA TGGCGATTTG ATTTTTGTGA GAGATGGGTC 960 

AGACATGGGA CAGGCCATCC AGACTTCCAC AGGTAACTAT AGCCATGTTG CCATTTATTT 1020 

GGATGGGATG ATTTATCATG CTAGTGGACA GGCTGGTGTT GTCTGTCAAG AACCGGCAGA 1080 

CTTCTTTGAG TCCAATCATT TATACGACCT CTATGTTTAC CCAGAAATGG ATATCCAGTC 1140 

GGTGAAGGAA AGAGCTTGCA AACATCTTGG AGCACCCTAC AATGCTTCTT TCTATCCAGA 12 00 

TGCAGCTGGT TTTTACTGCT CCCAGTATAT AGCAGAAATC CTACCTATTT TTGAAACTAT 1260 

TCCTATGAAA TTTGGAGwTG GGGAGCAGGA GATTAGTGAT TTTTGGAGGG AGTATTACAT 1320 

AGAACTAGGT CTGCCTGTTC CTCTGAACCA AGCTGGTACC AATCCTAGTC AGTTGGCAGC 1380 

ATCGCCTCTG TTACAATGTA AAGAAAGGAA TCTTCATGAT TCAGATTTTT AATCCATCTC 1440 

GTTTGACGAG ACAGCCATTT TTGGAGAATT GATCCGCTAT CTGGATCAGT ATGAGGATGT 1500 

GATTCTACGG GAAATTAAGG CTCAATTTCC AGATGTTGCA GTTGATAAAC TCATGGAAGA 1560 

GTATATAAAG GCAGGCTTGA TTCTACGTGA AAATAAGCGC TATTACCTCA ATTTTCCTAC 1620 

GCTTGAATCA CTTGATAGTC TTGAACTGGA TCAAGAGATT TTTGTCAGAG AAGCTAGTCC 1680 

GGTCTATCAA GCCTTGTTGG AGCAGAGTTT TGAGACGGAA TTGCGCAATC AAATCAATGC 1740 

AGCTATTTTA GTTGAAAAGA CGGACTTTGC GCGCATTAAA ATGACCCTGT CCAATTATTT 1800 

TTACAAGGTC AAACAGCAGT ATCCTTTGAC AGAAAAACAG CAGGAGCTCT ATGACATTTT 1860 

AGGAGATGTT AATCCTGAGT ATGCCCTCAA GTATATGACG GCTTTTTTGT TGAAATTTCT 1920 

CAAAAAAGAC CAGCTTATGC AGAAATGCCG TGATATCTTT GTGGACAGTT AGGTTGTCTT 1980 

AGGCTATATT GTGCAAAATG AAGATGGAAA GTATGAGTTG GCTATCGATT TTGATAAGGA 2 040 

GAGGTTAACT TTCTACTTAG CGTGATTTCT TGTTTCTGAG TACATTGTTT GACTTTCCTT 2100 
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AGTATTCGGT ATAAACTATA TGTAACCGGT AACACATATC GGAATAAACT AAAGGAGACA 2160 

ATCATATGTC ACTTGAAAAC AAATTGGAAC AAGCAACAGG CGCTGTCAAA GAAGGTTTTG 2 22 0 

GTAAAGTTAC TGGAGACAGC AAGACAGAAC TTGAAGGAGC TGTTGAAAAA ACAGTTGCTA 2 280 

AGGCAAAAGA CGTTGTAGAA GACGCAAAAG GTGCTGTAGA AGGTGCCGTT GAAGGTTTGA 2 340 

AAAACGTTTT TACTAAAGAA TAGGAAAAAA TCAAGGGTTT CATTTTCCCT TGATTTTTTC 2400 

TATTCTTATA AATAATTTTC TGCGACGGCT GTATCTCCTG GGTAGGATTC TTTCTTGCCC 2 460 

TGGATGATTT GGTAACAATC GGCTCCCTTA CCCGCAATAA TAACTGCATC TAATTCGTGA 2 520 

TTTGTGATAG CCATTGCCGC CTTGATGGCT TCTTGGCGAT CCGCAATCTT TTCAACAGGA 2 580 

TGATTGATGT AGCTACTAAT TTCATCTGCA ATGGCCATTG GGTCTTCATA GTTAGGGTCA 2 640 

TCAGCAGTCA GAAAGACTTG AATCTCAGGG TGTTGATTGA GGAGGAGGCC AAAGTCCTTA 2 700 

CGACGACTTT CTCCCTTGTT TCCTGTTGAT CCCAGAACCA GAGCAATCTT TCCGGTTTGA 27 60 

TGAGTTTCAA CCACATTGAT GAGTTTTTTC AGACTATCCC CATTGTGGGC ATAGTCGATG 2 820 

AAGACCTTGG CTCCATTTTT CTGAGTGAGG ACTTCCATAC GACCAGG A At : UCUMWTCCA 2 880 

GCGATGCCTT TTTTGATGTC CTCAAGACTT GCTCCGAGAC GGAGACAAGC AAGTCCAGCA 2 940 

GCAACTGCAT TTTCTTGGTT GAAGTTGCCA ATGAGTTGAA TATCATAATC TCCAGCGAGT 3000 

TTACCCGTAG CTGAAAAGCT AAAGGCTTTG GAATTCTCGA TTTGGTTATC AAATTGGCTA 3 060 

CCATAGAAAT CATGGTCTTG ATCTTCAACC TGTTCTTTCA AGACTGAGAA GTGGTCCATG 3120 

TCACTGTTAA TGATGACTGC TCGGCTCTTT TCCATCAAGA GACGCTTGTG GTAGAAATAG 3180 

TCTTCAAAGC TAGGGTGTTC AATCGGGCCG ATATGGTCTG GGCTGATATT TAGGAAAACT 3240 

CCCACATCAA AGGTTAGACC ATAGACACGT TTGACCAGAT AGGCTTGACT GGAGACTTCC 3 300 

ATGATGAGGT GGGTACGGTC ATTTTGCACA GCCTGATTCA TCATGTCAAA GAGGTCAATA 3 3 60 

CTCTCAGGGG TTGTCAACGC TGACTTAAAG AAAGTCTCGC CATCAAGAGT TGTGTTCATG 3420 

GTCGACAACA TAGCAGGTCT ATGCCCTTGA GATAAGATGT TATAGGCGAA ATAGGCTGCT 3 4 BO 

GTTGTCTTAC CCTTAGTACC AGTAAAGGCA AGGAGTTTGA GTTTTTCCTG TGGATTACCA 3 540 

TAGAACTCCA TGGCAATCAA ACTCATGGCT TTCTTTATAT CGTTCACAAT GATGACAGGG 3 600 

ATACCGACTT CGTAGTCCTT TTCAGCTACA TACCAAGCTA ATCCTTGTGT TATAGCAGAA 3 660 

AGAAGGTATT CTTTTTTAAA GGCAGCGCCT TTTGCGAAAA AAAGAGTGTC TTCTGTTACT 3720 

TTTCGGCTGT CGTAGCTGAT GCTATCAAAA ATAACTTTGC TGTAGTTGTA GTGGTAATGA 3 7 80 

CCTTGGTCAA TAATTTCGCG AAAAAGGCCA TCTTTCTTTA AAATATCTAA TACGGTTTCA 3840 
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ATCTTAATCA TACTTTCTAT TGTAAACCGA AAGTCGTAAA TTTACAAGTA ACAAGGAAAA 3 9 00 

GTTTATAATG GAAGATAAGG AGTTTTTCCT AGTTATCAAA ATTGAATGAG GAATCTATGT 3960 

CGCACGAAAA CAATCACCAG CAGGCCCAGA TGTTACGGGG GACTGCTTGG CTAACGGCTA 4 020 

GTAACTTTAT CAGTCGCCTA CTCGGGGCTG TTTACATTAT CCCTTGGTAC ATCTGGATGG 4 080 

GGGCTTATGC AGCTAAGGCA AATGGTCTCT TTACCATGGG TTACAATATC TATGCTTGGT 4140 

TCTTGTTGGT TTCAACAGCG GGGATTCCAG TTGCGGTGGC CAAGCAAGTT GCCAAGTATA 4 2 00 

ATACCATGCG AGAAGAAGAG CATAGCTTTG CCCTGATTCG GAGCTTCTTA GGCTTTATGA 42 60 

CAGGACTAGG CCTGGTTTTT GCTTTAGTCT TGTATGTCTT TGCTCCTTGG CTAGCAGACT 4 320 

TGTCTGGCGT GGGCAAAGAC TTGATCCCAA TCATGCAAAG CTTGGCTTGG GGAGTCTTGA 4 3 80 

TTTTCCCGTC TATGAGTGTT ATCCGAGGAT TTTTCCAAGG GATGAATAAC CTCAAACCCT 4440 

ATGCCATGAG CCAAATTGCT GAGCAGGTCA TTCGTGTTAT CTGGATGCTC CTAGCAACCT 4 500 

TTATCATTAT GAAGCTCGGT TCAGGAGATT ATCTAGCAGC CGTTACCCAA TCAACCTTTG 4 5 60 

CTGCCTTTGT CGGTATGGTA GCCAGTTTTG CAGTCTTGAT TTATTTCCTT GCCCAAGAAG 4 620 

GTTCACTCAA AAGAATCTTT GAAACAGGAG ATAAGATTAA CAGTAAGCGT CTCTTGGTTG 4 6 80 

ATACCATTAA GGAAGCCATT CCTTTTATCC TGACAGGGTC TGCCATCCAG CTCTTCCAGA 4740 

TTTTG 4745 
(2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 219: 

CCTGATTGAC CTTATAATAA GGAACAAAAC ACAATGCACT ACCTTTTCAA CAAAAGAGTT 60 

GCTGCTTGAT TAAAACCATC ACACCAGTTA TACCATTTTG CTTCATACCC ATCTTGAGCT 12 0 

AGGATACGAT CTTCTAAATC AAAAACAGAG TAAATCTTTC TTTCCTCGCA AGCTTGCGCA 180 

TAGAGATGAT ATAGTTCATC ACCACCATCT CTATCCCACT CAGCAGAAAT CGTATCCCGA 24 0 

CCTGCCAATA AAGCCTGATA AGCCCTGTGA TGCCCATCTG TAATCAGCAA ACAATCTCCA 3 00 

AAGGCAAGAA TACTGATTGG ATCGACTTGG ATTGTTTCTG CCGACTGGTA AAGCATCTGA 3 60 

ATATCTTGCA ACTTCTTTTC TGATAAATAT AGTTGAGTCA GATGAAGATC TGCTATATTG 42 0 

ACTTTCATTT CTTTCTCCTC AAGGGAATTC GATACTCACT TCTGTTTGCC TTTAAATCGC 480 
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CATTGGAAGC GGAgCTTGTC ATAAAAGGGA AACTCGATAA ACAGGACTCC CAAGCCCACA 540 

CAGAGACTGG CAAGGACGTC TGATGGGTAA TGAACTCCCA GATAGACTCT TGATACCAGC 600 

ACACTGACTA GGTAGAGGCC AAGGACGATT TGTACGATTT TTCTCCAGAC CTGATCTTTA 660 

ATCCGCTGAC TAAGAATAAC AATCAAAGTC CCTACCATCA GCGTTACAGC TAGAGAATGC 720 

CCACTTGGGA AGGAAAATCC CTTCTCCTCC ACCAGATGTA AAATAGCTGG TCGTGGGCGC 7 80 

TGGTAGATAT TTTTAAAGGT CACGATTAAA AGACCTGCCA AAGCCAGATT TCCCAGCATG 840 

AAGAAACTTT CTATCTTCCA TCGCTTACGA TAAAAGACAA AAGCTGTAAT GACAACCCAA 900 

GTGATAATCA CTGGGATATC AATCAGACGT GTGAGGGCTC GAAAAAGAAT AGTCAAATAA 960 

TCTGGTAAGT CTCCTCGAAT GGCAGTCTGA ATCGATTGGT CAAAATTGAC CAACATTTCA 1020 

GGGTAAAATT TGACCATGTA GCCAAGAATA ACGAAAAGTA AAAGGGCAAA ACTGCCCTTC 1080 

ATTAAAAATG TTTGTTTATC TC'TCATAATG TTTTAAGGTT GGTTTCAAGA GAACATACAA 1140 

CAACCAGAAT GAAACGGAAA AGAT AACAC C TTCAATCAAG TTAAAAGGTA ATACCATGGT 1200 

CATTAGGTAG TTGGAAAGTC CCAAAATTTT TCCAATATCA AAGTTAGCAA ACTTAGCGTA 12 60 

CAAAGGAACA GCATAAACAT AGTTGAGAAC CAACATGGCC AAGGTTAAAC CAATAGTTCC 13 20 

AGCTAGAGAG CCTAGTAGGA AACGAAGGGT TGTCCGTTCC TTTTTCCAAA TCAAAGCAAA 1380 

TACGATGACA AAAACTCCCA AAGCTACGAT ATTCATCGGC AAACCAATGT AAGTATTCAC 1440 

TCCTTGGCTG TTAAGAAGCA ATTTCAAGAG TGAGCGAAGC AAGAGCACTC CTAGAGmCsC 1500 

AGGCAAATCC ATGACCACCA GACCCACAAG GACTGGCAAG ATACTAAATT CGATCTTGAG 1560 

GAAAGATGCC GCTGGTAAAA GCGGAAAGTC AAAGTACATC AGCACAAATG AGATGGCTGA 1620 

TAGAATTGCA ATGGTCGAAA GTCGACGTGT GTTTGTCATA ACAGGTTCCT CCAATTTTCT 16 80 

ATAAAATCAG AAGAAGTTGG AAAGGATTCC TCTATCTATT CTCACTTTTT ATATCCCAAA 1740 

AGTTCCCTCT TACTCTATTA AAGAAAAACA AAGCAAGTGG TTACAATCCG GCTATAAATC 1800 

TATCAAAACA GACAAGGCTA TTCTTTCGTC TTCTCCCATC CAGACTATAC TGTCGGTTGT 18 50 

GGAATCTCAC CACATCACGT TGCGCTCACG GACTTCTTTA 1900 
(2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

GGTTTTCCAG CAGGAGCTTC TCCTTTATCA GAATGACCAT CCCATCTGCT CACGATAGAT 60 

GAATAATGAT ATTTTTTACC ATGATAGTAA TTTGAAAAAG CCTAACCACC TCCTGAACCT 120 

TCTCCATATG TCCATACTCC TCCATCTGGA TATTATACAG CAGCTGATGC AGCTCCCAAT 180 

AATGTAAAAC TTGAAATAAG AGCTAGAGCA AGTAATCTAT GTTTTTTCGT TTTCATTTTA 240 

TTTTTTCTTT CAAAAAAAGC ACACCTTGAG CAACAATGCA ACAAAATAAA TCCTCCTCTC 300 

TCTTTTATTG AAACCGCTTT CTTATGTGAT AAGAATAACT TTTTTAT t AT TTGTTGTCAA 360 

GGAAAAAATC GAATTTTTTA GATATTTTAC TATATTACCT CTGTGAATAA TATTATATAG 42 0 

TAGTTTTATT TCAAAATAAT ATGCAACCAG TACTAACCAA ATATAAAATA GATGCCATTA 480 

ACGAATTTTA TTCAAGTTTT TCCCATTCAT ACTATACAAG TAAAAGAGAT GGTGTTAACT 540 

AAAAAGCAAT TCAAACTATT GTAAAATTCC TAGCAAAAAG AGAGCCGAAA CTCTCTTTTT 600 

TATCTTCTTT TACTTTTTTT GACTGGCATG AGTGTGATGT CTCTAACACT AAAGTAAGCT 660 

AGGATCAACA TGGCTATTGC TAGGAATATT TCTGTTGGTA ATTGAAAAAT TTTCAGAAAA 7 20 

GATAGAACCA ATAAAATCAA GAGTGCCACT AAAATACATA CCATAGCGAC GATATTGACA 7 80 

GTCCCTTTAA TGCTTTCTGG TGTCGCAAAT ACATAGAGTA GGAGCAGTAA AATTCCTAGG 840 

ACTAAATAGA CCATCTTTCT CTCTTTCTAG CTCTTATTCA GCTGATTTTT TCTTCTTGTT 900 

AGCTTTCTCA CGCTCTGCTT T GTTAAGG AT TTGTTTACGC AAACGGATAG ACTCAGGCGT 960 

TACTTCCATG TACTCATCGT CGTTCAAGAA CTCAAGAGAC TCTTCAAGTG TCAAGATACG 1020 

AGGCGTCTTG ATAACAGCTG TTTGGTCCTT AGTAGCTGAA CGAACGTTGG TCATTTGTTT 1080 

TGCCTTCGTG ATGTTAACTG TCAAGTCATT TTCACGAGAG TTTTCACCGA TGATCATTCC 1140 

TTCATAAACC TCAGTACCTG GGTTGACAAA GATCGTACCA CGTTCTTCGA TAGACATGAT 12 00 

TGAGTAAGTT GTAGCCTTAC CAGCATCGAT AGAAACAAGG GCACCACGGT GACGTCCACC 12 60 

AATTTCCCCT GGAATCAATG GCAAGTATTG GTCGAAGGTA TGGTTCATGA TACCGTAACC 1320 

ACGAGTCATT GATAAGAACT CAGTTGAGTA TCCAATCAAA CCACGCGCTG GAACAAGGAA 13 80 

GACCAAACGA GTTTGACCAT TACCAGTTGA AATCATATCC AACATTTCAC CTTTACGTTC 1440 

AGAAAGGCTT TGGATAACAG ACCCTTGGTA TTCTTCTGGA GTGTCGATTT GTACACGTTC 1500 

AAATGGTTCA CATTTAATAC CGTCGATTTC TTTTACGATA ACTTCTGGAC GAGATACTTG 1560 

AAGTTCATAG CCCTCACGAC GCATTGTTTC GATAAGGATT GACAAGTGCA ATTCTCCACG 1620 

TCCTGAAACA GTCCATTTAT CTGGTGAATC AGTTGGGTCA ACACGAAGGG AAACGTCTGT 1680 

TTGCAATTCT GCCTGCAAGC GTTCTTCCAC CTTACGAGAA GTTACCCATT TACCTTCTTT 1740 
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ACCAGCAAAT GGTGAGTTGT TGACCAAGAA AGTCATTTGA AGAGTTGGCT CATCGATGTG 1800 

TAGGATTGGA AGAGCTTCTA CTGCATCTGT CGGAGTGATG GTTTCACCGA CAAAGATGTC 18 60 

TTCCATACCT GAAACGGCAA TCAAGTCACC CGCTTTGGCT TCTTGGATTT CACGACGTTC 1920 

CAAACCAAAG AAACCGAAGA GTTTTGTAAC ACGGAAGTTT TTAGTTGTAC CGTCAAGTTT 19 80 

AGAAAGGGTA ACTTGGTCCC CAACCTTAAC TGTACCACGG AAGACACGAC CGATACCGAT 204 0 

ACGTCCAACG AAGTCATTGT AGTCCAAAAG TGACACTTGG AACTGCAAAG GCTCATCTGA 2100 

GTTATCTACT GGAGCTGGGA TATGGTCGAT AATCGTGTCA AAGATTGGTG CCATAGTCGC 2160 

TTCTTGGTCA GCTGGATCAT CTGACAATGA AGAAGTTCCG TTGATCGCTG AAGCATAAAC 2220 

CACTGGGAAA TCAAGCTGGT CGTCATCTGC ACCAAGCTCG ATGAAAAGTT CCAAGACTTC 22 8 0 

ATCCACTACT TCTGCTGGAC GAGCTGATGG CTTATCGATT TTGTTAACAA CCACGATTGG 234 0 

GACAAGGTCT TGTTCCAAGG CTTTTTTCAA TACGAAACGA GTTTGTGGCA TGGTTCCTTC 2400 

ATAGGCATCT ACGACCAAGA CAACACCGTC AACCATTTTC ATGATACGCT CAACTTCTCC 24 60 

ACCAAAGTCC GCGTGTCCTG GTGTGTCCAT AATGTTGATA CGAGTTCCGT TGTAAGCAAC 252 0 

GGCAGTATTT TTAGCAAGGA TGGTAATTCC ACGCTCTTTT TCGATATCGT TTGAGTCCAT 2 580 

AGCACGCTCT GCCAATTCAG TCCGTGCATC AAGCGTTTCT GATTGTTTCA ATAATTCGTC 2 64 0 

AACCAGGGTT GTTTTACCGT GGTCAACGTG GGCGATAATC GCAATGTTAC GGATATCTTC 2700 

TCTTAATTTT GTCATGATTT CCTCTATAAT ATTCAAAATT TATTTTCTAA CTGAACGATT 27 60 

ATACCATAAT TTCAAATAAA TAACATAACT CAAGCAAGTG TAAATGTTTT CACTCTGCTT 282 0 

TTCTTTTCAC GTCAAGCCTT TTCAAAGCGA GCGACTTATG ATAAGATAGG CACAGTATGC 2 88 0 

GTTTAGATAA TTTATTAGCT CAAGAAAAAA TCAGCCGAAA GGCCATGAAG CAAGCACTCC 2940 

TCAGAGGGGA AATTCTAGTC GATGGTTGCC CAGCCCGCTC CCTAGCTCAA AATATCGATA 3 000 

CAGGACTACA AGAACTCCTT TTTCAGGATC GAATCATTCA AGGCTATGAA CACACCTATC 3 060 

TTATGCTTCA TAAACCTGCT GGTGCCGTTA CAGCCAACAA AGACAAGGAA CTTCCGACCG 3120 

TCATGGACCT GCTTCCATCT AACATCCAGT CTGACAAGCT CTATGCCGTT GGCCGACTGG 3180 

ACCGAGATAC AACGGGACTC CTCCTCTTGA CCGATAACGG TCCCTTGGGC TTTCAGCTCC 3240 

TCCATCCCCA ATATCATGTC GATAAGACTT ACCAAGTTGA GGTTAATGGA CTTCTAACAC 3 300 

CTGACCATAT CCAAACCTTT CAAAAAGGAA TTGTCTTTTT AGATGACACT GTCTGTAAAC 3 360 

CCGCAAAACT AGAGATTCTA TCTGCAAGTC sCTCCCTCAG TCAAGCCTCT ATCACCATTT 3420 

CAGAAGGAAA ATTTCATCAA ATCAAGAAAA TGTTCCTCTC GGTTGGTGTT AAGGTGACTA 3 480 
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GCCTCAAAAG AATCCAATTT GGGGACTTCA CATTGAACCC AGATTTAGCA GAAGGTAACT 3 540 

ACCGCCCTTT GAACCAAAAA GAGTTACAAA TCATTAAAAA CTATTTAGAG ATGAGTCGAT 3 600 

AAAACAAAAA AAGCTTTAAA ACTAAAGCTT TTTTCTTTTA TTTACCGAAA AATTAAGGCG 3 660 

ATTGCTACAA TCCAGTTAAC TACAGAAATC ACAATTCCTA AGATATTAAG AATCTTTTCT 3720 

ATTTTATAGT CTAATTGTGA CTCTTTTTGG TATGAAATAG CCAAGACCAA TCCTATGATA 3 7 80 

CCCAAAATCA GGCCTACAAT TGGAAATAAC AAACCAAGAA TAATCGACAA GATACCCACA 3 84 0 

AAAAGTGGAT TTTTCTTCTT TTCTTTTATG TTCTAAGAAC TCCTTAAATT TTATACAAAT 3 9 00 

TAATTATACT ATAAAACAAT AGCTTCATCC TATCATTCGA CTAATTTGGA AATAAGGTTA 3 9 60 

GCTAGTCTTC ACTTTCCCTT TCCAAGAATC CAAGCCATAA GAAAGGATAT AAATCTCAGA 402 0 

AAAACCTTGT TTTTTCAAGT AAAGAGCTGC ATTTGTAACT CGTTGCGCAC GTTGGTTTTC 4 080 

GTAGAGAAGG ACAGGTTTAT CTTTACGAAG GGCTGCAAGA CTACTTTTCA ACTGACTTGA 414 0 

AGGAATATTG CGTGCACCAA GGATATGTTT TCTGTGGAAT TCTGCTGGGT CGCGCAAATC 42 00 

AATCAATTGA CCCGTACGAA TCAAGGCTTC AAACTCCTCA TTGTCCACAA TTTTAGCCGC 42 60 

ACGGCGAATA CGAAGATAGT TAAAGCCCAT CCACGCCAAC ATTGCTAGTA TAAGTGCCCA 43 2 0 

CAAAATCCAA GTAACCATTA GTTCTTTTCT CCATTTTTCT CAATATAATC CAATTCTACC 43 8 0 

TTGTGCTCTC TGCGAAGAAC TGCTTCTGCC TCTAGATAGT CTAATTTATC CATCAACCCT 4440 

GCATCGTAAA TCCGAGATAG TTCCAACTTC ATCAGTTCAA TATCATATAA GCGTTTTCCC 4500 

ATGTAAACAA TAATACCAAA TCGTTTGAGG AATTGCTGCA CATCATAGAA TGTTTTCATA 4560 

AGACTCATTC TAGCAAAATT TTGTGTTTTT TTCAAGAAGA GACTCACACA ATGCTCCTTA 4 620 

TTTTCCTATC TTCTTTAGCG ATTCTAAGGC AAGTATGGTA CAATAAAAAC ATGGGGATTC 4 680 

AACAATTACA TT 46 92 
(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQtJHNGK CHAXAC'KK 1 ST.U'S : 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 
GCTAAAAAGC TGATAATCTT CGACTCCTGT ATATGATGTG TCTTTTCATG TAAGACACGC 60 

GCCGCCAGAA TCATGGCAAG AGCTGCAAGA CTGGCAAGTA AGAAGCCGAT AAGATAGGCA 12 0 

AAAAGATAAG TGAATTTGAC AAAGAAAGTC AAAAGAACTA GGAAACCAAA GCCTCCTCCA 180 
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AAAACTACCA AAGTCTTTCG TAAATCCCAG ATTTTATCCA ACTGCTTGAC GAGGGAAGTC 240 

GTCTGACGAA CGCCTACAAT AGTTGCTAAC ATACTTCCTA AAAAGAATGG ATAGACATGA 300 

GTTAAACTGG AGAAATAAAC AGAGGAATAA GAGGTCACTA GAAAACTACC AATAAACATG 360 

GAGAAGAAAC TGATCAAGAA GGCAACAGCA GATAAGAGAA AGACCATCCC CTTCAACTGA 420 

CCATTTGATT TAGCTTGTTT GGATAAGAAC CAAACTGCCA ATCCCCAAAG AATATAGTAG 480 

TGAACCTCAA CTGCCAAACT CCAATTATGA ACAAACAAAT GAGGAATGAA CTGAGATTCA 540 

TAACTCCCAC CTGTTAGGAG TTCATAGAAG TTGGTCATAA AGCCTAAGAC GCCCGCAATC 600 

TGGCCACCAA TTCCAGCAAC ATAGTCTTGG CGAACCAAGA AAGTAAAAGG CATGGTCACC 660 

AAGACCATCA AAACCACAGG TGGCACAATC TCGATAAAAG CGTCTT 706 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3236 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

CAGCTGATGG GCAATATCAG TCATAGAAAT TTTTTCAATT AACTTTTGAG CAATTTTTTG 60 

GTTGATGATA CGAGGGATTT GGTGATTTTT CTTTACCAGG GGAGTCTCAG CAACCATCAT 120 

TTTTGAACAG TGATAGCACT TGAAACGGCG TTTTCTAAGG AGAATTCTAG AAGGCATACC 1B0 

AGTTGTTTCG AGGTAAGGGA TCTTAGACGG TTTTTGAAAG TCATATTTCT TCATTAGACT 240 

TCCACAATCA GGGCAAGATG GAGCCTCATA ATCCAGCTTA GCGATAATTT CTTTGTGGGT 300 

ATCCATATTG ATGATATCTA GAATCTTGAT GTTTGGGTCT TTAATATCGA GCAGTTTTGT 3 60 

GATAAAATGT AATTGTTCCA TATGATTCTT TCTAATGAGT TGTTTTGTCG CTTTTCATTA 420 

TAGGTCATAT GGGACTTTTT TTCTACACAA AAATAAGCTC CATAATATCC ATAGGGGATT 4 80 

TACCCACTAC AAATATTATA GAGCCCGAAA ATATGGGAAA ACTGATCCTT GTTTCTGCTT 540 

TTGTCTATAG AAGAATAATA AAGATTATCT TCTTCAAATT CTCCGATATT CTCTAAAGTT 600 

TTGTGCAAGT TGCACAGAAC TTGTTTATTT TTTTGGTCAT CTTGCCATAG AAATATAAAG 660 

CGTTTTCATA TATAATATAA TTATCAAAAG ACAAAAGGAG TTCACCTCAT GGTAGAATTG 720 

AATCTTAAAA ATATTTACAA AAAATATCCA AACAGCGAAC ACTATTCAGT TGAAGATTTC 7 80 

AACTTGAACA TCAAAGATAA AGAATTTATC GTTTTCGTAG GACCTTCAGG ATGTGGTAAA 840 
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TCAACTACAC TCCGTATGAT TGCTGGTCTT GAAGACATTA CAGAAGGTAC TGCATCTATC 9 00 

GATGGCGTAG TTGTCAACGA CGTAGCTCCA AAAGACCGTG ATATCGCCAT GGTATTCCAA 960 

AACTACGCTC TTTACCCACA CATGACTGTT TATGACAACA TGGCTTTCGG TTTGAAATTG 1020 

CGTAAATACA GCAAAGAAGA CATTAACAAA CGTGTTCAAG AAGCAGCTGA AATACTTGGA 108 0 

TTGAAAGAAT TCTTGGAACG TAAACCAGCT GACCTTTCAG GTGGTCAACG TCAACGTGTT 1140 

GCCATGGGGC GTGCGATTGT CCGTGATGCG AAAGTATTCT TGATGGACGA ACCTTTGTCA 1200 

AACTTGGATG CCAAACTTCG TGTATCAATG CGTGCTGAAA TCGCTAAAAT TCACCGTCGT 12 60 

ATCGGAGCTA CAACTATCTA TGTAACTCAC GACCAAACAG AAGCGATGAC ACTTGCAGAC 132 0 

CGTATCGTTA TTATGTCAGC TACTAAGAAC CCTGCTGGTA CAGGTACTAT CGGACGTGTA 13 80 

GAACAAATCG GTACTCCTCA AGAAGTTTAC AAAAATCCAG TTAACAAATT CGTTGCAGGA 144 0 

TTCATCGGAA GCCCAGCTAT GAACTTCATC ACCGTGAAAT TGGTTGGTAG CGAAATTGTT 1500 

TCTGACGGTT TCCGTTTGAA AGTGCCAGAA GGAGCATTGA AAGTTCTTCG TGAAAAAGGC 1560 

TACGAAGGAA AAGAATTGAT CTTTGGTATC CGTCCAGAAG ACGTGAATGC AGAACCTGCT 162 0 

TTCCTTGAAA CATTCCCAGA CTGTGTTGTA AAAGCGACTA TCTCTGTATC AGAACTGCTT 1680 

GGTTCAGAAT CTCACCTTTA CTGTCAAGTT GGTAAAGACG AGTTTGTTGC AAAAGTTGAT 1740 

GCTCGTGACT ACTTGCAAAC AGGTGCAACA GTTGAGCTTG GATTTGACTT GAACAAAGCA 1800 

CACTTCTTCG ATGTAGAAAC TGAAAAAACA ATCTACTAAA ATAAATAAAA TTCAAAGCAC 1860 

TACAAGAAAA GATATCTCTT TATCAATTGT AGTGGAGAGA TATCAGTTAA TCTAGGGAGA 1920 

GAAACAAAAT GCTTCTCTCC TTTTTGCTAG AGAAGTCATA TTATGCATCT ATATTGTGAT 1980 

GCTCTTTAAT ACTCTTCGAA AATCTCTTCA AACCACGTCA ACGTCGCCTT GCCGTACGTA 2040 

TGATTACTGA TTTCGTCAGT TTTATCTGCA ACCTCAAAGA TGTACTTTGA GCAGCTTACG 2100 

GCTAGTTTCC TAGTTTGCTC TTTGATTTCC ATTGAGTATT ATTTGTGGGT ACCATCTACA 2160 

AGTGAAGCTA TATGCGTAAA CTACGTGAGC AATTGAATTC GAACTAGAGA GGTAATAATA 2220 

AATTTATGCT ATAGTTATGG TGACTTGTAT GCTTTTGATT CTAGTTTATC AAATAATAGA 2 2 80 

TTAGAATTGT CAGATAATAT CATTTTGTGT TATAATGAAG AAAAAACAGA GGTGTTCAAA 2 340 

TGTCAGAAGC AGGTCATAAG TTTTTAGCAA AATTGGGGAA AAAACGCTTA CGTCCAGGTG 2400 

GAAAGCGTGC CACAGATTGG TTAATTGCAG AAGGAGGATT TTCAAAAGAA AAGAGAATAC 2460 

TAGAGGTTGC GTGTAATAGG GGAACTACAG CAATTGAGTT GGCACAGCGT TTTGGTTGCA 2520 

AGATAACTGC TGTTGATATG GATGCTCAAG CTTTAGAAGT GGCTAAAAAA TCTGCTGGAA 2 580 

CGGCAGGTGT TGCTCATTTA ATCAGTTTTG AAAGAGCAAA TGCAATGAAA CTTCCTTATC 2 640 
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AAGATGCTAG TTTTGATATT GTTATAAATG AAGCTATGCT GACTATGCAA GCCGATCAAG 27 00 

CTAAGAAAAA ATGTGTAATG GAATATCTAA GGGTATTAAA ACCTGGAGGT CTTCTCTTGA 27 60 

CACATGATGT GCTTCTTAAG GAAGCTAAAG AGTCTATCAG ACAGGAATTA TCACAAGCAA 2 82 0 

TTCATGTAAA TGTAGGTCCT TTAACTCAAG ATGGTTGGGA ACAGGTGATG ATAGAATCAG 2 880 

GTTATTGTGA TGTGAAAGCA TTGACTGGTG AAATGACATT AATGAAATTA TCGGGTATGA 294 0 

TTTATGACGA AGGTTTGCTA GGAACTTTGA AAATTTGTGT AAATGCTTGT AAAAAGGAGA 3000 

ATAGAAAGCA GTTTTTAACT ATGTATAAAA TGTTTGCTAA GAATAAACAG AAATTGGGCT 3060 

TTATTGCGAT GGCTAGTTAT AAATCGTCAA AACGTTAGAT AATTATTGAA GTTAACTTTT 312 0 

CCTTTTTTCT TTCTTAAAAA ATATGCTATA ATAGAGAGTA AAAAACTTTG AAAGAAAGAA 3180 

AAAGATGAAT T T AAAAG AT T ACATTGCAAC AATTGAAAAT TATCCAAAGG GTACCG 323 6 
(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2885 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 
CCTGACTTTT CAAATTGGTT AGTTTGCCAC ACTTGGTTTA TATGCTCGTG GAAAGCATGG 60 

CTATTACTTC TCAAAGGGCG ATTTCTCACC CCATGAAAAG TGTCTATTTT TGTTTAGGTT 120 

TGTAAGTTAA TTCATTGTCA CATATTACTC TTTAACTGAT TGAGTGAGTA CCGCTTATAT 180 

TTGATGCCAA ACGCCTTAAA AGTGTTACCC TCAAGTCCTT TTAGAATACG GCTATAATTC 240 

CGCTCATTGT AAACTATCTT AAGCTCATCA CTATCTAGGT TGGTATTAAA AATGGTATTT 300 

TCACGATTGT TTAGCACGTC AAAGAGTAAA TCCTGCTCCC AGTCACTCTT AGGCTTAATA 3 60 

ACAGCATTTT TTGCTCCTAA ATCATCAATA ATTAAGTAAT CAACAGACTT CATGAGTTCA 42 0 

GTAGCTTCAA ACTCTGTAAG TGTTGCACCT TTACCATAAT TCCACCCCTC TTTAATTTGT 480 

TTGATCATTT CGGTTAGGCT TACAAAAAGC ACACTCTTAG GTTCTCCTTT TGTCTTATAC 540 

CCCTCATTTA TACCTTTGGC AATAGCAACT GATAAAAGTG TTTTTCCAAT CCCTGTACCT 600 

CCTGTGATAA GCGTATTTCC CCTCATGCCA TCAAGATATT TTTGTACCTG ACCTTTTGCA 660 

AATTCTAAAA ATCGCTTTTC TTCTGATGTT ACAGCATTAA AATCATCAAA AGTTTTAGTT 720 

TTAAACTCAT CTGCTACATA GCTCTTATTG CTCATCAACA CATTATAAGT TTGCATATAT 780 
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AGTTTAGCAT TCAAATTATC AGCAATCGCA TCTTCTTCAT CTTGCTTTTT CTGTTCTTCT 840 

TGGCATTGTT CACAATAGGG TGGGATACAG CGAACTTCTT TTATTGCCTC TCCGTTCTCA 900 

TTCCACCCCA CTACTACATG TCTTTCTCCT TTGATTTGTG TTAGCTGTAT TTCATGCTTA 960 

GGACACAATT CGTCTAGTTT AAATGTCTCA ATATTTCCTA AACTAGATTG TAATGATTTC 1020 

ATTTTCTGAC CTCCTAAAAT GGTTTTTCTT GTGTTGGTAT CCAATCTTCA TAGCTGGTAG 1080 

GCTCTAGTTG ATTGGTTTGC TGTTTTTTAG CCTCACGCGC TGCCCTGCTA TTTCTAACAA 1140 

GTTCCACCGT CAATAAATTG TCCTGTTTCC AACGGTTAAG GATTACCTTG ATGTATGCAA 12 00 

AGTTTGCTTT ACCCTGACTG ACAGCCTCTT TTAACGCCTC ATGGATAAGC TCTGGGCTAA 12 60 

AATCTTCTAG CATATACTGC AATTCTTGAA TCTGTAACGG TGACAATGCT TTACCTGTCT 1320 

CAGCTCGCTT CATATTCAAC AAGTCGTCTA TTTCCACACT GGTTACTTTT TTATTTACAA 13 80 

AATCAGAAAT CAGTTGAAAA ATGTTTGGAC TTTGTAGCTG GATTTCAGCC ATTACCTCAT 1440 

CAAATTCTGC TTGTGTCATG TTGTCTAAAT CTAGTGTCAT TGCATTGCCT CCTCAAACTT 1500 

CTCTATAAGA CAACTTTTAT TTGCTTTCTG AGTTCCATTT TTAGAGTTAA AAAGAATATC 15 60 

TTTTAAGGTT ACAGTAGCCT CTAAATACTC CTTTTCAGCA TGCTCTATAT ACGCCTGTTG 162 0 

CTCTGCTTCG TTCTCAAAAA AGTGCTTAGC TTGGCGTTTA AAGAATGCTT TTCGCATAGC 1680 

GTCCATTTCA AAAATACCAG GGGCGAAAAA CATTCCCGTA GTGCTTTTAG AGACCGCTTC 174 0 

GATTTTATGG CTTTCATTCA ATTCAGGAAG TTCAATCCAA AGTAAACGGG ACAACTCATC 1800 

TTTCATCGAT TTTGTCTGAC TTTCCAATAA AGAAAGGATT CTTAGGCCAT TTTCTTCGCT 1860 

AATTTCTCGC ATTTCTGCGC TAATTCTGTC TATACGTCTA GTTAAATTCT CATATGTTGT 192 0 

TTCTGTCATG TTTTTACCTC TGTTTCTTTG TTGGTGTGAT TTTTTAGCTT ATTTTTTTAC 19 80 

TTCTAAACAT CATTGTCTTA ATTTCCTGAT AACTCATTTT CAATTCAATC ATAGCTATTG 204 0 

CCATATCCTC AAATGCCTGG TACTGCTCCA ACTCCTCACT AGTCAAGCTA TCGATACCGT 2100 

TATAGCCCCC ACGCTCTTCT CTTAACTGCT TAGCGTTCAT GTCTGTTACT GCCTTTAGTA 2160 

GCAAGTTGTT CATGGTGCTA TGCGCGTGCT TTGGTGCATT AGGCCATGTT TCTATACTGT 22 2 0 

CATGCAAGGT TTTTCTTTTC GGTTTTTCTA GCGCCCTCTG CAGACGAATT TCAGAAAGTT 22 80 

CCTCACGCAT TTCAAAGAAT GCTTTGACTA GGTTTAGTTT GAATTGCCGT ACTGTTTCGG 2340 

TATTCTTTAA ATAAGTGATC AGAAAAGTAG CCTGTTGCTC GTTCAGAATA TAGGATTTTT 2400 

TAGGTTGTCC TCTAGTATCT AATTTATGGA TTTTAAATCC AAGTATTCCC AACTCTTCAA 24 60 

AGTCAGCCTT ATTTTCTCTT ATTAAGCGCG TGATAGTGTG GTGTTGTACT TCAGCACATT 252 0 

CAGCGATGAT CTCGCTTGTG GTGTACGGCT CTTTCTTACC GTCCATGTAA ACTAGTTCCA 2580 
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TTACGGTTCT ACCTCCTGTA TAAATCTGGT TAGCTTACTT TTTAATTGCC TCCTCTAGCC 2 640 

TCTTTTTTAG CCTCTAAAAC GGCTTTGGCT AGTGGTTAAT ATTATTTACC ACTTGTCTCT 2 700 

ATAAACGTGT TAGAGGCCTT TATAACGACT TGTATCGCTG TATCGATATC CTCCGTGGAA 2760 

TAGTAGATTT ATTTTCTAAT ATCATTCAAG ACTTGTTTAA CCCATTTCTT GAAAGAAATA 2820 

AAATTACATC TTCTTTATCC TTGGCATCTG CTTTGTCTGA GACAAATTAG AATGTCAATA 2 880 

CTTGG 2885 
(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3144 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 
TATCAATCCT TTCCCATTAT AGGAGCAACA GAGTGGGAGT AGTCATCTAA GGACTAATTT 60 
ATGTATTTTT ACGAGTCAGT ATCTTGGGAT ACTGGTTTTT ACTTTTCTAG ACTTTTTGAC 12 0 

TACTTGTTAA AACTGGGATA ATTTTCGACT GTTTAACAGT TATTATGCAA AGTCTAAAAG 180 
ATTAGAATTG TCAAAACAAT CCGTCTAGGC TTGATTTTAT CCTTTATTTi 
AGAAGGAAAA ATGTCAAACT TTTATATTGC AAATAGGAGA AATCATGACA £ 
AACGTCCTGA GGTTTTATCA CCTGCAGGGA CTTTAGAGAA GCTAAAGGTA GCTGTTCAGT 
ATGGAGCAGA TGCTGTCTTT ATCGGTGGTC AGGCCTATGG TCTTCGTAGC CGTGCGGGAA 
ACTTTACTTT CGAACAGATG G AAGAAGG CG TGCAGTTTGC GGCCAAGTAT GGTGCCAAGG 
TCTATGTAGC GGCTAATATG GTTATGCACG AAGGAAATGA AGCTGGTGCT GGTGAGTGGT 
TCCGTAAACT GCGTGATATC GGGATTGCAG CAGTTATCGT ATCTGACCCA GCCTTGATTA 
TGATTGCAGT GACTGAAGCA CCAGGCCTTG AAATCCACCT TTCTACCCAA GCCAGTGCCA 
CTAACTATGA AACCCTTGAG TTCTGGAAAG AGCTAGGCTT GACTCGTGTC GTTTTAGCGC 
GTGAGGTTTC AATGGAAGAA TTAGCTGAGA TCCGCAAACG TACAGATGTT GAAATTGAAG 
CCTTTGTCCA TGGAGCTATG TGTATTTCAT ACTCTGGACG TTGTACTCTT TCAAACCACA 
TGAGTATGCG TGATGCCAAC CGTGGTGGAT GTTCTCAGTC ATGCCGTTGG AAATACGACC 
TTTACGATAT GCCATTTGGG AAAGAACGTA AGAGTTTGCA GGGTGAGATT CCAGAAGAAT 
TTTCAATGTC AGCCGTTGAy ATGTCTATGA TTGACCACAT TCcAGATATG I 
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GTGTGGACAG TCTAAAAATC GAAGGACGTA TGrAGTCTAT TCACTAyGTA TCAACAGTAA 1080 

CCAACTGCTA CAAGGCGGCT GTGGATGCCT ATCTTGAAAG TCCTGAAAAG TTTGAAGCTA 1140 

TCAAACAAGA CTTGGTGGAC GAGATGTGGA AGGTTGCCCA ACGTGAACTG GCTACAGGAT 1200 

TTTACTATGG TACACCATCT GAAAATGAGC AGTTGTTTGG TGCTCGTCGT AAAATCCCTG 12 60 

AGTACAAGTT TGTCGCTGAA GTGGTTTCTT ATGATGATGC GGCACAAACA GCAACTATTC 13 20 

GTCAACGAAA CGTCATTAAC GAAGGGGACC AAGTTGAGTT TTATGGTCCA GGTTTCCGTC 1380 

ATTTTGAAAC CTATATTGAA GATTTGCATG ATGCTAAAGG CAATAAAATC GACCGCGCTC 1440 

CAAATCCAAT GGAACTATTG ACTATTAAAG TCCCACAACC TGTTCAATCA GGAGACATGG 1500 

TTCGAGCTCT TAAAGAGGGG CTTATCAATC TTTATAAGGA AGATGGAACC AGCGTCACAG 15 60 

TTCGTGCTTA ATGTAGTTGT TTAGTTTTAA AAAACTATGC AAAGCTCCAT ATACAACACT 1620 

TAAACGAGAT TAAAGAATGG CGAAATCCCT TGATGCGCAA GAGATTAGCT GTCTTTTTTA 16 80 

TTTTTTAAGT GATAAAGTCG GAGTTTAGGC ATCAAAGCCT ATCAAATTAA ACAAAGAAGC 1740 

GATGTCTTAG ATATTTTGAA AAAAATTAAT AAGCAGAAAA CTCTCTATTA TTTTGTTGTA 1800 

GAGAGTTTTT TGTTAATAAA ATTTCACAAA ATGACATTTA TATATTGCAT TAAGTTAGAT 18 50 

ATATGATATA ATATTGTTAA AAAGAGGCGC AACTTTTTAA AATTAATGAG AATCAAAGAG 1920 

AAAACCAATA ATATTAATGG AGGAATAAAA AATGTAAGTA AGCATTATGG TCATTCAATC 19 80 

ATTCTCAAAG ATATAAATTT TGCACTTAAC AAGGGTGAAA TTGTTGGTCT AGCAGGGAGA 2 040 

AATGGAGTTG GTAAGAGTAC GTTGATGAAA ATTCTTGTTC AG AATAAT C A ACCGACTTCA 2100 

GGTAATATTA TAAGCAGTGA TAATGTTGGG TATTTAATCG AAGAACCAAA ATTATTTTTA 2160 

TCTAAAACAG GTTTAGAGAA TTTAAAATAT TTGTCAAATT TATATGGTGT TGACTACAAT 222 0 

CAAGAAAGAT TTAGATGTTT GATCCAAGAG TTAGATTTGA CTCAGTCTAT TAATAAAAAA 22 80 

GTAAAGACCT ATTCTTTGGG TACAAAACAA AAATTAGCTT TGCTTCTAAC TCTCGTTACG 2 34 0 

GAACCTGATA TATTGATTTT AGATGAACCG ACTAATGGTT TAGATATTGA ATCATCACAA 24 00 

ATAGTTTTAG CGGTTCTAAA AAAATTAGCT TTACATGAAA ATGTGGGAAT TTTAATATCG 24 6 0 

AGTCATAAAT TAGAAGACAT TGAAGAAATT TGTGAGAGAG TTCTTTTCTT GGAGAACGGG 2520 

CTTTTGACAT TTCAAAAAGT AGGAAAAGAT AGTCATAATT TCTTGTTTGA GATAGCTTTT 2580 

TCATCAGCTA CAGATAGAGA CATTTTCATT ACCAAACAAG AATTTTGGGA TATTGTTTAG 2 640 

GAAGAGGGAT TGAGAATTAC TATGTCTGGG AATATTCAAA ATAGTGAGCT TTTTAAATTT 27 00 

TTTAACGAAA ACTCTATTAA AGTAGTTGAT TTTGAAACTA AAAAAGAGAC GCTTAAAGAT 27 60 

ATTTACCTAA ATCGTTCAAA ATAAAGGAAG GTTATAATCA TGAAATTAAA TAAACAGAAG 282 0 
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AATCGGATGA TTTACGTCTT GTCTAATTTT CTATATGCTA TCTCAGTTTC CATTATTTAT 2 8 80 

GCTTTGAATG GCATTGTGTT ACTAGTCATA GTAAGTAAAT TGGGTATTCC AGGTGATTTA 2 940 

GGATTAAATT TTATAGTAGC TATTGTAGTC AATACAATTT TGTTAGTCCT GTTTTATTTT 3 000 

CTATTATCTT ACATTTTCTA TTTATACAAA TTGAAAAGTG GCTTGGTATw TGGTATTTTA 3060 

GTAGCTTTAC TACTCTTTAT CTCTAATATA TTAAATACGA TGATGATGAA TACTAGTAAT 3120 

GATTTGTTTA TCAAAGCAAT TGAA 3144 
(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37S6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

TACGGTATTA TTTTTAAGGA GAAAGAATCA TGAAAATCAA AAAATGGCTT GGTCTAGCAG 60 

CCCTTGCTAC AGTCGCAGGT TTGGCTCTTG CAGCTTGCGG AAACTCAGAA AAGAAAGCAG 120 

ACAATGCAAC AACTATCAAA ATCGCAACTG TTAACCGTAG CGGTTCTGAA GAAAAACGTT 180 

GGGACAAAAT CCAAGAATTG GTTAAAAAAG ACGGAATTAC CTTGGAATTT ACAGAGTTCA 240 

CAGACTACTC ACAACCAAAC AAAGCAACTG CTGATGGCGA AGTAGATTTG AACGCTTTCC 3 00 

AACACTATAA CTTCTTGAAC AACTGGAACA AAGAAAACGG AAAAGACCTT GTAGCGATTG 3 60 

CAGATACTTA CATCTCTCCA ATCCGCCTTT ACTCAGGTTT GAATGGAAGT GCCAACAAGT 420 

ACACTAAAGT AGAAGACATC CCAGCAAACG GAGAAATCGC TGTACCGAAT GACGCTACAA 4 80 

ACGAAAGCCG TGCGCTTTAT TTGCTTCAAT CAGCTGGCTT GATTAAATTG GATGTTTCTG 54 0 

GAACTGCTCT TGCAACAGTT GCCAACATCA AAGAAAATCC AAAGAACTTG AAAATCACTG 600 

AATTGGACGC TAGCCAAACA GCTCGTTCAT TGTCATCAGT TGACGCTGCC GTTGTAAACA 660 

ATACCTTCGT TACAGAAGCA AAATTGGACT ACAAGAAATC ACTTTTCAAA GAACAAGCTG 72 0 

ATGAAAACTC AAAACAATGG TACAACATCA TTGTTGCAAA AAAAGATTGG GAAACATCAC 780 

CTAAGGCTGA TGCTATCAAG AAAGTAATCG CAGCTTACCA CACAGATGAC GTGAAAAAAG 840 

TTATCGAAGA ATCATCAGAT GGTTTGGATC AACCAGTTTG GTAATAAGAA ACAGGGAGGT 900 

GGGAGAGAAA ATTCCACCTC TTGCTTTTGT ATAGAGTATA GATTGTAAAG AAGACTATTC 9 60 

GTTCATAGAA AGGTAGAGAG AATATGGTTT TTCCTAGCGA ACAAGAACAG ATTGAAAAAT 1020 
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TTGAAAAGGA TCATGTAGCC CAGCATTATT TTGAGGTTTT GCGTACCTTG ATTTCTAAGA 108 0 

AATCAGTCTT TGCCCAGCAG GTTGGACTCA AGGAAGTCGC AAATTATCTG GGTGAGATTT 1140 

TCAAGCGTGT TGGAGCTGAA GTGGAGATTG ATGAGAGCTA TACAGCGCCC TTTGTCATGG 12 00 

C AC AT T T C AA GAGTTCGCGT CCAGATGCCA AGACCTTGAT TTTCTATAAC CACTATGACA 12 60 

CTGTGCCAGC GGATGGGGAT CAGGTCTGGA CAGAGGATCC kTTTACGCTT TCGGTCCGCA 132 0 

ATGGCTTCAT GTATGGGCGT GGGGTTGATG ACGACAAGGG TCATATCACA GCTCGCTTGA 13 80 

GTGCTTTGAG AAAATATATG CAGCACCATG ATGATTTACC TGTCAATATC AGCTTTATCA 1440 

TGGAGGGAGC GGAGGAATCG GCTTCAACAG ACCTAGATAA GTATTTGGAA AAGCATGCAG 1500 

ACAAACTCCG TGGGGCGGAT TTGTTGGTCT GGGAACAAGG GACCAAAAAT GCCTTGGAAC 1560 

AGCTGGAAAT TTCTGGTGGC AATAAGGGGA TTGTGACCTT TGATGCCAAG GTAAAAAGCG 162 0 

CTGATGTGGA TATCCACTCG AGTTATGGTG GTGTTGTGGA ATCAGCTCCT TGGTATCTCC 16 80 

TCCAAGCCTT ACAGTCTCTT CGTGCTGCGG ATGGCCGTAT CTTGGTTGAA GGCTTGTACG 1740 

AAGAAGTACA AGAGCCCAAT GAACGAGAAA TGGCCTTGCT AGAAACTTAT GGTCAACGAA 1800 

ACCCAGAGGA AGTTAGTCGG ATTTATGGAT TGGAGTTGCC TCTCTTACAG GAGGAGCGGA 18 60 

TGGCCTTTCT AAAACGTTTC TTTTTCGATC CAGCGCTTAA TATCGAAGGA ATCCAGTCTG 192 0 

GTTATCAAGG TCAGGGTGTT AAGACTATTT TACCTGCAGA AGCCAGTGCC AAGCTAGAGG 1980 

TTCGTCTGGT TCCGGGCCTA GAACCGCATG ATGTTCTGGA AAAAATTCGG AAACAGCTAG 2040 

ACAAAAATGG CTTTGATAAG GTAGAATTAT ACTATACCTT GGGAGAGATG AGCTATCGAA 2100 

GCGATATGAG CGCACCAGCC ATTCTCAATG TGATCGAGTT GGCCAAGAAA TTCTATCCAC 2160 

AGGGCGTTTC AGTCTTGCCG ACGACAGCGG GGACAGGACC TATGCATACG GTCTTTGATG 222 0 

CCCTAGAGGT ACCAATGGTT GCATTCGGTC TAGGAAATGC CAATAGCCGA GACCACGGTG 2280 

GAGATGAAAA TGTGCGAATC GCTGATTATT ACACCCATAT CGAATTAGTA GAGGAGCTGA 2340 

TTAGAAGCTA TGAGTAGAGA TATTATCAAG TTAGATCAGA TCGATGTGAC TTTTCACCAA 2400 

AAGAAGAGAA CCATCACAGC GGTTAAGGAT GTGACCATTC ACATCCAAGA AGGGGATATC 24 60 

TACGGAATCG TTGGATATTC TGGAGCAGGA AAATCAACCC TTGTACGGGT GATTAATCTC 252 0 

TTGCAAAAAC CATCTGCAGG GAAAATTACC ATTGACGACG ATGTGATTTT TGACGGCAAG 2580 

GTGACCTTGA CGGCAGAGCA GTTGCGTCGT AAACGTCAAG ATATCGGAAT GATTTTCCAG 2640 

CATTTTAACC TGATGAGCCA AAAGACAGCA GAGGAGAATG TAGCCTTTGC CCTTAAACAC 2700 

TCTGAACTCA GCAAGGAAGA AAAGAAGGCT AAAGTAGCTA AGTTGTTGGA CTTGGTTGGT 27 60 

TTGGCAGATC GTGCTGAAAA CTACCCTTCA CAACTATCTG GAGGGCAAAA ACAGCGTGTG 282 0 
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GCAATTGCGC GTGCCTTGGC CAATGATCCA AAAATCTTGA TTTCAGACGA GTCAACTTCT 2880 

GCCCTTGATC CGAAGACAAC CAAGCAGATT TTGGCCTTGT TGCAAGATTT GAACCAAAAA 294 0 

TTAGGCTTGA CTGTTGTCTT GATTACGCAT GAAATGCAGA TTGTCAAAGA CATTGCCAAC 3 000 

CGTGTTGCAG TTATGCAGGA TGGGCATTTG ATTGAAGAGG GTAGTGTGCT TGAAATCTTC 3060 

TCAAACCCTA AACAACCTTT GACTCAAGAC TTTATCTCAA CAGCTACAGG TATTGACGAA 312 0 

GCCATGGTCA AAATCGAGAA GCAAGAAATC GTGGAACACT TGTCTGAAAA CAGTCTCTTG 3180 

GTGCAACTCA AGTACGCTGG AGCTTCAACA GACGAGCCAC TTTTGAATGA ATTGTACAAG 324 0 

CATTACCAAG TAATGGCTAA TATTCTCTAT GGGAATATCG AAATTCTCGA TGGTACTCCT 3 3 00 

GTTGGAGAAT TGGTGGTGGT TTTGTCAGGT GAAAAAGCAG CGTTGGCAGG TGCCCAAGAA 3360 

GCCATTCGTC AAGCAGGTGT ACAACTAAAA GTATTGAAGG GAGTACAGTA AGATGGAATC 342 0 

ATTGATTCAA ACCTATTTAC CAAATGTCTA TAAGATGGGT TGGGCTGGTC AGGCAGGCTG 3480 

GGGAACGGCT ATCTACTTAA CTCTTTATAT GACAGTTCTT TCCTTCATTA TCGGAGGCTT 354 0 

CTTGGGGCTA GTGGCAGGTC TCTTTCTCGT CTTGACAGCG CCAGGTGGTG TCTTGGAGAA 3600 

TAAAGTCGTA TTCTGGATTT TAGACAAAAT TACCTCAATT TTTCGTGCGG TTCCCTTTAT 3660 

CATCCTCTTG GCAATCTTGT CACCACTTTC TCACTTGATT GTTAAAACAA GTATCGGGCC 3720 

AAATGCAGCC CTTGTCCCAC TTTCTTTTGC AGTCTTTGCC TTCTGG 3766 
(21 INFORMATION FOR SEQ ID NO: 226: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2520 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 226: 

TGTTGCTGAG TTAATCGGTA CGTTCATGTT TGTATTCGTC GGGACAGGAG CTGTTGTTTT 60 

TGGAAATGGT CTTGATGGCC TTGGTCACCT TGGAATCGCC TTTGCCTTTG GTTTGGCAAT 120 

CGTGGTGGCA GCCTACTCAA TCGGAACTGT TTCAGGTGCT CACTTGAACC CGGCTGTTTC 180 

GATTGCTATG TTTGTAAACA AACGTTTGTC ATCTTCAGAA CTTGTAAACT ACATCCTTGG 240 

TCAGGTTGTT GGAGCTTTCA TCGCTTCTGG CGCTGTCTTC TTCCTCTTGG CTAACTCAGG 300 

TATGTCAACT GCTAGTCTTG GTGAAAATGC CTTGGCAAAC GGTGTCACTG TCTTTGGTGG 3 60 

TTTCTTGTTT GAAGTCATCG CAACTTTCTT GTTTGTATTG GTTATCATGA CTGTGACTTC 420 
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AGAAAGCAAG GGCAATGGCG CGATTGCTGG TTTGGTAATC GGTTTGTCAT TGATGGCGAT 4 80 

GATTCTTGTC GGATTGAAGA TTACTGGACT TTCAGTAAAC CCAGCTCGTA GCTTGGCACC 540 

AGCTGTCTTG GTAGGCGGCG CAsCCTTCAA CAAGTTTGGA TTTTCATCCT TGCACCAATC 600 

GCTGGTGGAG TTCTTGCAGC CCTTGTTGCA AAAAATTTCC TTGGAACAGA AGAATAATTG 66 0 

AAACTCAAAA AGCCTTGCTC CTCATCTTGA GGAACAGGGC TTTTTCGTAT GATACTCTTC 720 

GAAAATCTCT TCAAACCACG TCAGCTTCAT CTTGCCGTAG TATGGTTACT GACTTCGTCA 780 

GTTCTATCCA CAACCTCAAA ACAGTGTTTT GATCTGACTT CGTCAGTTCT ATCTGCAACC 84 0 

TCAAAACAGT GTTTTAAGCT GACTTCGTCA GTTCTATCTG CAACCTCAAA ACAGTGTTTT 900 

AAGCTGACTT CGTCAGTTCT ATCTGCAACC TCAAAACAGT GTTTTAAGCT GACTTCGTCA 960 

GTTCTATCTG CAACCTCAAA ACAGTGTTTT AAGCTGACTT CGTCAGTTCT ATCCACAACC 1020 

TCAAAACAGT GTTTTGATCT GACTTCGTCA GTTCTATCCA CAACCTCAAA ACAGTGTTTT 108 0 

GATCTGACTT CGTCAGTTCT ATCCACAACC TCAAAACAGT GCTTTGAGCA ACcTGCGGCT 1140 

AACTTCCTAG TTTGCTCTTT GATTTTCATT GAGTATGACT TTAGCGGTTG TCAATTTTCT 1200 

CTGGATAAAG GTCGTGTTGG AAGAGGCGTT GTTCTGCCAA GCCCTCATAC TTAGTTCCTT 12 60 

GCTTACCGTA GTTGTAGTAG GGGTCGATTG AAATGCCACC GCGCGGAGTG AATTTTCCCC 1320 

AGACTTCTAA ATAGCGAGGG TCTAGCAAGT TGACCAAGTC TTTCCCGATG GTGTTGATAC 13 8 0 

AGTTTTCGTG GAAATCTCCG TGGTTTCGGT AGCTAAATAG ATATAGTTTG AGGGATTTTG 1440 

ACTCGACACA GAGCTTGTCA GGAATGTAGG AAATATGAAT CGTCGCAAAG TCTGGCTGAG 1500 

CAGTGATTTG TCCCAGCAGA GACATATCGA GGATATGGTG ACGAATGCCC TGTTCCTTAG 1560 

CGATTTCTCT AGTAATTTGA ATTTCGAGGT GATGACGTTG GCCGTAGGCA AAGGTGACAG 162 0 

CTTCGACTGT TTCATAGTGT TGCATGACCC AGAAAAGGCA GGTTGTTGAA TCTTGACCAC 1680 

CACTAAAGAC GACCAAGGCT AATTGACGTT TCATAGTACT CCTTCCAAAA TGGGAAATGT 1740 

TCAGAGCACG CAAAAAGCTC CCATTAGGGA GCTAAAAAAT ACCAAATCGA GGTTTTTTTA 1800 

GCGATGGCAT ATCCCAAACA TCGTAATATT CTACTTATAT AGTAAAATGA AATAAGAACA 1860 

GGACAAATCG ATCAGGACAG TCAAATCGAT TTCTAACAAT GTTTTAGAAG TAGAGGTGTA 1920 

CTATTCTAGT TTCAATCTAC TATAGTCTAG CATATTTTTT GAAAAATGGC AAAGGGCAAG 1980 

AAAAAAGAGA CCAAAGAAAG TACTTGGTCT CTCGTTTGAT TAGCTCAATT CAGCAATGAT 2040 

GGCCTTGATT TGTTCTGCTG TGTGAACACC TGCAACTTGT TTGACAACTT GGCCGTCTTT 2100 

TTTGAAGAGA AGAGTTGGAA TAGACATGAT TCCAAAAGCA CGAGCTGTGT TTGGATTTTC 2160 

ATCAACGTCC ATTTTAACGA TTTTCAAGAC ATCTTCTGAA AGTTCTTCAG ACAATTTGTC 2220 
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CAAGATTGGA CCTTGCATAC GACATGGACC ACACCAAGTT GCCCAGAAGT CTACTAAGAC 22 80 

CAAACCGTCT TTTGTTTCTT GTTCGAATGT TGCATCTGTA ATTGCTTTTG CCATTGTATT 23 4 0 

TCTCCTTTTT TTAGTTATAT TGGCTTAAAT CTTGTTTCAT GAGATAGAAG AAGATATCTC 24 00 

CATAAGTCCC ATGGTAGTCC AAATTATGAC CCTTGTAAGT TAATTTTTGG ACAGGGTAGT 24 6 0 

AkkCTGCGAC GCCGATAAGG CAAGCTTGTT GCGAACGTTC AAAGTCTTCA TAAGACTCGG 252 0 
(2) INFORMATION FOR SEQ ID NO : 227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5278 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

ACTCAGTTAG ATTTTGTTTT CAAAAACAAC GAAGAAAAAG ACCATGTTGC TCTACTTGGA 60 

AGAATTGGCT CCGAACGTGT TTATCGATAT ATTAATAAAA AATATTTAGA TTTACCGGAA 12 0 

ACATTCGAAA ATTATAATGT TTTTGTACCA GAAGCTAATG GAAGTGGTGC CTTAGGTGAA 180 

GTCTTATCAA CACCCCTAAT CGGGGAACCC CTAATCGGGC ATACAGATAC TTTTTTATCT 240 

ATTGGTAATT TTAAAACAAA ATTTGAAGCC GATGCTTGTA TTAAATTTAT TAAAACTAAA 300 

TTCGCTAGAG TATTATTAGG TGTTTTGAAA GTTACTCAGC ATAATTCACG CAAAACTTGG 3 60 

TATTACGTCC CCCTCCAAGA CTTTACGGTC AATTCGGACA TTGATTGGAC ACAATCAGTG 420 

ACTGATATTG ACCGCCAGCT TGATCAAAAA TATGACTTTT CCCCTGAAGA AATTGCCTTT 480 

ATTGAGAATC ATGTAAGGGA GATGGATTAG AAAAGTATTT TTATTTGACA AATAGTGCTC 540 

AATGATCTAA AATGACTATA TAGGATTAGG TCAGGAAGCA TACGATGCCC TGACCCTTTT 600 

TGTACTTATG AGATGAGAAA GTCATTTGTT AGATAAATTG ACTCGTTAGC AAACGTTCAA 660 

AAAAGGAAAA CTTATGCCAG TAGAAATTAA AACCACTAAA GAAATTCATC CTAAAATCTA 720 

TGCCTACACC ACACCGACAG TAACCAGTAA TGAAGGCTGG ATTAAGATTG GGTATACAGA 7 80 

ACGTGATGTC ACACAACGTA TCAAGGAGCA AACGCATACA GCTCATATAG CTACAGATGT 840 

CTTATGGACT GGTGATGCAG CTTATACAGA AGAGCCTGAT AAGGGGAAAA CTTTCAAGGA 900 

CCATGATTTC CACCATTTCC TTTCTTTCCA TGATGTAGAA CGTCGTCCCA AGACGGAATG 9 60 

GTTCTATTTT AATGGAACTC CTGAAAAATC AAAAAATCTT TTTGATAAGT TTGTTCAGCA 1020 

TGATTTGTCT GGTTATCAGC CTGGAAAAGG ACAGGACTAT ACTCTGCGAC AAGAGCAAGA 1080 
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AGAAGCAGTT GCTAAGACAT TAGCTTATTT CCAAGAACAT GCTGGAGGCA AGTTTCTCTG 1140 

GAATGCCAAG CCACGCTTTG GTAAAACCTT GTCTACCTAT GACCTAGCTC GACGGATGGA 12 00 

AGCTGTCAAT GTCCTAATTG TAACAAACCG CCCTGCCATT GCTAACTCAT GGTATGATGA 1260 

TTTTGAAACA TTCATAGCAG GTCAAACGAC TTACAAGTTT GTTTCTGAAT CAGATAGCCT 13 2 0 

TAAGAGTCGT CCAATCTTGT CACGACAAGA ATTTCTTGGT ATTTTAGCTG ACGATGTAAG 13 80 

ACAACTTGCT TTTATCAGTC TCCAAGACTT GAAAGGATCT GTTTATTTAG GTGGAGAGCA 144 0 

CGATAAACTC AAATGGGTAA CTGATCTGCA TTGGGACTTG TTGGTTATTG ACGAGGCTCA 1500 

TGAAGGAGTT GATACCTTCA AGACTGACCA AGCCTTTAAT AAGATTCGAC GAAATTTTAC 1560 

TCTGCATTTG TCAGGTACAT CATTTAAAGC ATTGGCTAAA GGAGATTTTA CAGAGGAACA 162 0 

AATCTACAAC TGGTCTTATG CTGATGAGCA GGCTGCTAAG TATTCGTGGT CTCTTGAGCA 168 0 

AGAAGAGGAA AATCCTTATG AAAGCTTGCC TCAGTTGAAT CTCTTTACCT ATCAAATGTC 17 40 

TCAGATGATT GGCGAAAAGT TAGAAAAAGG CGCTCAGATC GATGGTGAAA ATATTGACTA 1800 

TGTTTTTGAC TTAAGTGAAT TTTTCGCTAC AGATGATAAA GGGAAATTTA TTCATGAGCA 18 60 

TGATGTCAGA AATTGGTTAG ATACTCTATC AAGCAATGAA AAATATCCAT TTTCAACCAA 192 0 

AGAACTCCGT AATGAACTCA AGCATACTTT TTGGCTTTTA GAACGTGTCG CTTCGGCCAA 198 0 

AGCATTAAAA GCCCTACTAG AAGAACACCC AATCTATGAA AACTATGAGA TCGTTCTAGC 204 0 

TGCTGGTGAC GGACGTATGT CCGAAGAAGA CGATAAAGTC AAACTCAAAT CCTTGGACTT 2100 

GGTTAGAAAA GCGATAGCAG AGAATGACAA AACCATTACC CTATCCGTTG GTCAGCTGAC 2160 

GACAGGTGTC ACTATCCCTG AATGGACAGG TGTATTGATG TTATCAAATT TGAAATCACC 222 0 

AGCTCTTTAT ATGCAGGCCG CCTTCCGTGC TCAAAATCCT TACTCATGGA GCGATAACAA 2280 

AGGAAATCAC TTTCGCAAAG AAAGAGCCTA TGTATTTGAC TTTGCGCCGG AAAGAACCTT 234 0 

GATTCTCTTT GATGAGTTTG CCAACAACTT ATTGCTTGTA ACTGCAGCTG GTAGAGGAAC 2400 

TTCAGCTACA CGCGAAGAAA ATATTAGAGA ATTATTAAAC TTCTTTCCAA TTATTGCCGA 2460 

AGACCGTGCT GGTAAGATGG TTGAAATTGA TGCAAAGGCA GTTCTAACCA CTCCTCGCCA 2 52 0 

GATAAAAGCT AGAGAAGTTC TTAAACGAGG TTTTATGTCC AATCTCTTAT TTGATAATAT 2 580 

TAGTGGTATT TTCCAAGCAA GTCAAACAGT TTTAGATATT TTAAATGAGC TGCCAGTTGA 2 640 

AAAGGAAGGG AAGGTACAAG ATAGTTCTGA TTTATTAGAT TTTTCAGATG TTACAGTCGA 2700 

TGATGAGGGA AATGCAGTAG TAGACCATGA AATTGTAGTT AATCAGCAAA TGCGACTTTT 27 60 

TGGTGAAAAA GTTTATGGAC TTGGTGAATC TGTTGCTGAG TTAGTCACAA AAGATGAGGA 282 0 

ACGAACTCAA AAACAGCTGG TCAATGACTT GAGTAAGACC GTTTCTTCAG TGATTGTAGA 2 880 
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GGAATTGAAA GCAGATTATT CTCTAAAAAC AAGGGAAACT GAGCAAATTA AGAAACAAAT 2 940 

TACAGCAACA CTTGAGAATG AAATTCGAAA AAATGATATC GAAAGAAAAA TTTCTGAAGC 3000 

TCATATCAAG CAAGAGTTGC AACAGCAGCT CAAAGAAGCA AATGATAAAG CGCAAAAAGA 3060 

TAAGATTCAA GAAGATTTGG AAAAACGTTT AGAAGAAAAT AAACTCATTC ATAAAGAAAA 312 0 

ACTAGAACAA ACACTCAAAA AAGAAGTGGA AAAAATGCCT GAGAAATTTA TCGAACAGGT 3180 

T G AG AT AAAA CGTGTGGAAC AGTTGAAACA ATCAGCTCAA GATGAAATTC GTGACCATTT 324 0 

ACGAGGGTTT GCAAGAACAA TTCCAAGTTT TATTATGGCT TACGGTGATC AAACTCTAAC 3300 

ACTTGATAAT TTTGATGCCT TTGTTCCTGA ACATGTTTTT TATGAAGTAA CAGGGATTAC 33 60 

GATTGATCAG TTTAGATATT TGCGAGATGG TGGGCAGGAT TTTGCAGGGC ATCTCTTTGA 342 0 

TAAAGCAACA TTTGACGAAG CTATTCAAGA ATTTCTTCGC AAGAAAAAGG AGTTGGCGGA 34 80 

TTATTTTAAA GATCAAAAAG AAGACATTTT TGACTATATT CCACCGCAGA AGACCAACCA 3 54 0 

AATTTTCACT CCTAAACGAG TGGTGAAAAG GATGGTAGAT GATTTGGAAA AGGAAAATCC 3 600 

AGGGATTTTT GATGATCCAT CTAAGACTTT TATTGATTTA TATATGAAGT CAGGCCTCTA 3 6 60 

TATTGCAGAA CTTGTGAAGC GGTTATATAA TAGCAATGGC TTGAAAGAGG CCTTTCCAAA 372 0 

TCCTGAAGAA CGCTTAAAAC ATATTTTGGA AAAGCAAGTT TATGGATTTG CTCCGTCTGA 37 8 0 

GATTATCTAT AACATTTCCA CTAATTTTAT ATTTGGCAAT CTTTCTAAAG ATATCAGTAG 3 84 0 

GAAGAATTTT GTTTTAGCAG ATACCATTCC AGCGGCTAAA GAAGGGAGCA TTCAAAAGTT 3900 

GGTTGATTCC TATTTTGAAA ATAATTAAAA AGAAGGCCGA GTCAAAATTC TTTGAAATCA 39 60 

GAAAAAACGC ATAATATTGA GTGCTTTTGT ACTGCCCCCC AAAAGTTAGA CAGAAAAAAT 4 02 0 

CTAACTTTTG GGGGGCAGTT CAGACAATCC TTGGTATTAT GCGTTTTATT GTGGGAAGAT 4080 

GTATAATGGA TTGAAATAAG ATATGAACAA ATCAATTAGG AATTTAAAGC ATTTTATAAC 414 0 

AACGTTTTAG AGTAATGGGG GGCTATTTCA ACTTCAACCT ACTATAATAC AGAAAAAAAC 4200 

AACTCCCTGA TAATTCAAGG AGTTGTCTAT AGTTAAATTA GTTTTTAGAA GCTTCTTGGA 42 60 

ATTCTGGGTT TTTCCATGCT TCGTCAATGA TAGCTTGTAA TTCTTTAGCA GATGCTTGCA 432 0 

TTTTTTGAGT TTCTGCGTCG TTCAATGGGA TATTTACTGG ACGAACGATA CCATGTGCAC 4380 

CAACAACAGC TGGTTGACCG ATAAAGACAT TCTCAACTCC GTATTGACCT TCTTGGAATA 4440 

CTGAAAGTGG AAGTACTGCG TTTTCATCGT CAAGGATTGC TTTAGTGATA CGAGCAAGGG 4500 

CTACTGCGAT ACCGTAGTAT GTTGCACCTT TTTTGTTGAT GATTGTGTAG GCTGCATCAC 4560 

GAACACCTTC GAACAATTCA ATCAATTCAG CTTCTTGAAC ATTTTGAGTG TCTTTAAGGA 4 62 0 
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ATTCTTCAAG GTTTACACCA GCGATGTTAG CGTGTGACCA AACAGCGAAC TCAGAGTCAC 4680 

CGTGTTCACC CATGATGTAG GCGTGCACTG AACGAGCATC CACATCCAAT TTTTCAGCAA 4740 

GTGCTTGACG GAAACGAGCT GAGTCAAGTG AAGTACCTGA ACCGATAACG CGTTCTTTAG 4800 

GGAAACCAGA GAATTTCCAA GTTGAGTAAG TCAAAACGTC AACTGGGTTA GCAGCAACAA 48 6 0 

GGAAGATACC TTTGAAACCA GATTCAACAA CTTGAGTTAC GATTGATTTG TTGATAGCAA 4920 

GGTTTTTACC TACAAGGTCA AGACGAGTTT CACCTGGTTT TTGAGGTGCA CCTGCAGTGA 49 8 0 

TCACAACAAG GTCAGCGTCT GCACAGTCAG AGTATTGAGC TGCATAGATT TTTTTAGGTG 5040 

AAGTGAAGGC AAGGGCGTGA CTAAGGTCAA GCGCATCACC AACAGCTTTT TCATGCAATT 5100 

GTGGAATTTC GATAATTCCA AGCTCTTGTG CAATTCCTTG GTTAACAAGT GCAAAAGCGT 516 0 

AAGATGAACC TACAGCACCA TCACCGACAA GGATAACTTT TTTGTGTTGT TTAGTTGAAG 522 0 

TCATTGTTTT AAACATCTCC TTAATTTTAT TAGGGGATTT TCCCTAGACA ACTTCATT 527 8 
(2) INFORMATION FOR SEQ ID NO: 228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1941 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 

ATAAGGAATC TCTAAAAAAT TTTAAGGAGA ATCTAGCAAA TGGATTTCAC ATGGGCACTG 60 

AAGTATGCCA CTGAATTTTT GGGAACTGCC ATTTTGATCA TTCTTGGGAA TGGTGCAGTT 12 0 

GCCAACGTTG AACTTAAAGG TACGAAAGGT CACCAAAGTG GCTGGATCGT CATCGCTGTT 180 

GGTTATGGTA TGGGGGTTAT GATCCCAGCC TTGATGTTTG GTAACGTATC TGGGAATCAC 240 

ATCAACCCTG CTTTCACTCT AGGGCTTGCA GTTAGCGGTC TTTTCCCTTG GGCACAAGTG 300 

GTACCTTACA TTATCGCGCA AGTCTTGGGG GCTATCTTTG GCCAAGCCTT AGTTGTGGCA 3 60 

ACATACCGTC CATTCTACTT GAAAACTGAA AACCCAAATA ACATCTTGGG AACTTTCTCA 420 

ACTATTTCAA GTATTGACCA TGGTACAAAA GAAAGTCGCT ATGCAGCAAC TGTCAATGGT 480 

TTGATTAATG AGTTTGTTGG TTCATTTGTT TTGTTCTTTG CAGCTCTTGG TTTGACTAAA 540 

AACTTCTTTG GTGCTGAAGT GCTTCAATTC ATGAAACAAA AGGCAACAGA AGCAGGACAA 600 

ACAGTTGATT TTTCTGACTT GGCTATTAAA GCACAGGTGG CTCCACACAC TGCTTCAGGA 660 

CTTTCTGTGG CTCACTTGGC ACTTGGATTC CTCGTTATGG CTTTGGTAAC ATCACTTGGA 720 

GGACCTACAG GACCTGCCTT GAACCCAGCC CGTGACTTGG GACCACGTCT CCTTCATGCT 780 
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TTCCTTCCCA AATCAGTTCT TGGTGAGCAT AAAGGCGATT CAAAATGGTG GTATTCTTGG 84 0 

GTACCAGTAG TAGCACCTAT CGCAGCAGCA ATTGCGGCAG TAGCTGTATT CAAATTCCTT 900 

TATCTCTAAG AAATAGCTCC TTTAACATTT GAGTGAGCAC CATCTATAAG TAAGAGAGGA 960 

TCAGACTGGk TCTCTCTTTT kGATTTTTaG GGAAATGAAA GAAcTCTAAA CAAACTCCTC 102 0 

TCCAGCAGTG GTTTAGAAGT CTCAGTGGGC TATTCCAGCT TCAATGGACT ATAGTAGGTT 108 0 

GCAGTTGAAA TAATAGACCC TTGTTTCTAA AACATTGTGA GAAATTGGTT TGAATTCTCC 114 0 

AATCAAATTG TGCAGTTTTC ATTCTACTAT ATATTATCGG AATATTATCG GAGATGGGTT 1200 

CCCTATCTTG TAAGTCTGCT TTATAGTGGG TTGAAGTTGG AATAGTCCTC CCTTCTTTCT 12 6 0 

CAAACATTGT GAGGAATTGA TTTACCTTCC TCAACAAAAT GTTCAGTTTC TATTTCATTT 132 0 

TACTATAAAA TAAGCGATTA GGGGGGCTAT TCTTCGACCT ACATTGACTC TGCTGAGTCC 13 80 

TATGATTGTT ATCGTTTTAT CTGCAATTTT ATACTCAATG AAAATCAAAG GGCAAACTAA 1440 

GAAGCTAGCC GCAGGTTGTT CAAAACACAG TTTTGAGGTT GTATAGTAGA TTGAAACTAG 1500 

AATAGTACAC ATCTACTTCT AAAACATTGT TAGAAATCGA TTTGACTGTC CTGAACGATT 1560 

TGCCCTATTC TTGTTTCATT TTACTATATA AACCAGAGAC TGTTTACATT TTCAGCAAGT 162 0 

GAGTGGATGG ATAATGCTGA AAACTCCTTG AAGGATAAGT CTATTTAGTA CTTTCTATTA 1680 

ATTAGTTAAA TTTTTACCAA GAATAATTCA CAAAAACGTT GTAAAACACT TGCAATTTAG 174 0 

CTGAAATTTG ATAAAATAGT AAGGAAAGTT AGACTGTATT GCCTACTGTC TATCTATAAA 1800 

ATATATTTTA TTGGAGGCTT TTACTCAAAT GGCAAAAGAA AAATACGATC GTAGTAAACC 1860 

ACACGTTAAC ATTGGTACTA TCGGACACGT TGACCACGGT AAAACTACCC TAACTGCAGC 1920 

TATCACAACT GTTTTGGCAC G 1941 
(2) INFORMATION FOR SEQ ID NO: 2 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 755 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 

ATTTGAAGAA ATTGAAGAAA TCGTAGCCCC TACAGATGGT GAATTTTTGG GGGAAGTTTT 60 

ACTTGGAACT GGGGTAGTTC TCTTAATTGG AGTAGCCTGT TGTTAAAAAG ATAGGGAGTG 120 

ATAATCATGC AAGATAACTT TTTATTTGAG GAAATTGAAG AAATTTCAGT ACCAGTTAAT 180 
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GATTTTTCAG CTGGACTTGC AACAGGTATC GGATTTGGTT TAGCAATCCT TGCTCTTGCT 240 

GGTTGTTGAA GTTTGTTCAT TTACTAACAT CAAGCTTTTT CAATTTCATT TTAGACAGTC 3 00 

ATTTAAATTT TCCGTATTAG TCTTGCAGCA AGAGATTAAT AGAATTAGTC ATTATTTTAT 3 60 

TGATTGCGGA CTGAGGGACT AGAGTATGTT TTACTTAACC CCTCTTTTAT TTATTAAAGG 42 0 

TTAGGTTTGT TATGAGAATT GTTGATAAGA TTAAGATATT ACCTACTCCT TATGAGGGAC 480 

ACTATCATTT ATATATACCA TCCAGTAAGA AACATGTATT AGTTGGGAAA CAGGAAAAAA 54 0 

ATGGTTAGAG CAACTAATAG GTCAAGAATT TACCATATCG GACTTATTAG TGTTAGTAGG 600 

GAAGAAATAT TTTTAAAATA TCTTGGGACT TTAATATAAC ATTATCTGAA AAATTAAACT 660 

ATAAAAGATT TAATAAGAAT TTTGAAAAAA TCCTATCTTG TTGTCATTAT ATTTGCAACG 72 0 

ATACATGAAA TTAGTCATGC AATAATTGCT AATAA 755 
(2) INFORMATION FOR SEQ ID NO : 23 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1483 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 

CCAGAAAAAC CGTAGTGGAG CTCGTGGAAC AGTGGAATTG ATTTTCCAAA AAGAATACAA 6 0 

TAAATTTTCA AGTATCTCAA AGAGGGAGGC ATAAGATGTC AGATGCATTT ACAGATGTAG 120 

CCAAGATGAA AAAAATCAAA GAAGAAATCA AGGCACATGA GGGACAAGTC GTAGAAATGA 180 

CTTTGGAGAA TGGTCGTAAG CGCCAAAAAA ATAGATTGGG TAAGCTAATT GAAGTTTATC 240 

CATCTCTATT TATTGTGGAG TTTGGGGATG TGGAAGGAGA TAAACAAGTT AATGTTTACG 300 

TTGAATCCTT TACTTACTCA GATATTCTTA CAGAAAAGAA TTTGATTCAT TATCTTGACT 3 60 

AAAGTGAGAA ATTTTCTCAC TTTTTCTTTT TTCTCCGAAT AATTTAGGTG AAGGCAATCA 42 0 

TCGCTTTATA TTATTTTTCA AGGAGGAAGA ATGAAAATTT TACCGTTTAT AGCAAGAGGA 480 

ACAAGTTATT ACTTGAAGAT GTCAGTTAAA AAGCTTGTTC CTTTTTTAGT AGTAGGATTG 54 0 

ATGCTAGCAG CTGGTGATAG TGTCTATGCC TATTCCAGAG GAAATGGATC GATTGCGCGT 600 

GGGGATGATT ATCCTGCTTA TTATAAAAAT GGGAGCCAGG AGATTGATCA GTGGCGCATG 660 

TATTCTCGTC AGTGTACTTC TTTTGTAGCC TTTCGTTTGA GTAATGTCAA TGGTTTTGAA 72 0 

ATTCCGGCAG CTTATGGAAA TGCGAATGAA TGGGGACATC GTGCTCGTCG GGAAGGTTAT 78 0 

CGTGTAGATA AT AC AC CG AC GATTGGTTCC ATTACTTGGT CTACTGCAGG AACTTATGGT 84 0 
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CATGTTGCCT GGGTGTCAAA TGTAATGGGA GATCAGATTG AGATTGAGGA ATATAACTAT 
GGTTATACAG AATCCTATAA TAAACGAGTT ATAAAAGCAA ACACGATGAC AGGATTTATT 
CATTTTAAAG ATTTGGATGG TGGCAGTGTT GGGAATAGTC AATCCTCAAC TTCAACAGGC 
GGAACTCATT ATTTTAAGAC CAAGTCTGCT ATTAAAACTG AACCTCTAGC TAGCGGAACT 
GTGATTGATT ACTATTATCC TGGGGAGAAG GTTCATTATG ATCAGATACT T G AAAAAG AC 
GGCTATAAGT GGTTGAGTTA TACTGCCTAT AATGGAAGCT ATCGTTATGT TCAATTGGAG 
GCTGTGAATA AAAATCCTCT AGGTAAtTCT GTTCTTTCTT CAACAGGTGG AACTCATTAT 
TTTAAGACCA AGTCTGCTAT CAAAACTGAA CCCCTAGTTA GTGCAACTGT GATTGATTAC 
TATTATCCTG GAGAGAAGGT TCATTATGAT CAAATTCTCG AAAAAGACGG CTACAAGTGG 
TTGAGTTATA CGGCTTATAA CGGAAGTCGT CGCTATATAC AGCTAGAGGG AGTGACTTCT 
TCACAAAATT ATCAGAATCA ATCAGGAAAC ATCTCTAGCT ATG 
(2) INFORMATION FOR SEQ ID NO : 231: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 

CCCGGAAAAC AAGTTAAAGT TGAAGTTGGT CAAGCAGTTT ACGTTGAAAA ATTGAACGTT 60 

GAAGCTGGTC AAGAAGTTAC TTTTAACGAA TTGTTCTTGT TGGTGGTGAA AACACTGTTG 12 0 

TCGGAACTCC ACTTGTTGCT GGAGCTACTG TAGTTGGAAC TGTTGAAAAA CAAGGAAAAC 180 

AAAAGAAAGT GGTTACTTAC AAGTACAAAC CTAAAAAAGG TAGCCACCGT AAACAAGGTC 24 0 

ACCGTCAACC ATATACAAAA GTTGTCATCA ACGCAATCAA CGCTTAATTT TAAGGAGAAC 3 00 

AC AT GAT AC A GGCAGTCTTT GAGAGAGCCG AAGATGGCGA GCTGAGGAGT GCGGAAATTA 3 60 

CTGGACACGC CGAGAGTGGC GAATACGGCT TAGATGTCGT GTGTGCATCG GTTTCTACGC 42 0 

TTGCCATTAA CTTTATCAAT TCTATTGAGA AATTTGCAGG CTATGAACCA ATCCTAGAAT 480 

TAAACGAAGA TGAAGGTGGC TATCTGATGG TTGAAATACC AAAAGATCTT CCTTCACACC 54 0 

AGAGAGAAAT GACCCAGTTA TTCTTTGAAT CATTTTTCTT AGGTATGGCA AACTTATCGG 600 

AGAACTATTC TGAGTTCGTC CAAACCAGAG TTATCACAGA AAACTAACAC GGAGGAAAAC 660 

ATTATGTTAA AAATGACTCT TAACAACTTG CAACTTTTCG CCCACAAAAA AGGTGGAGGT 72 0 
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TCTACATCAA ACGGACGTGA TTCACAAGCA AAACGTCTTG GAGCTAAAGC AGCTGACGGA 7 80 

CAAACTGTAA CAGGTGGATC AATCCTTTAC CGTCAACGTG GTACACACAT CTATCCAGGT 84 0 

GTAAACGTTG GTCGTGGTGG AGATGATACT TTGTTCGCTA AAGTTGAAGG CGTAGTACGC 900 

TTTGAACGTA AAGGACGCGA TAAAAAACAA GTGTCTGTTT ACCCAATCGC TAAATAAAAA 960 

GGTCCATTGA ACCTTTTATC CCGAACCTTG AAATGTAGAG GTGAGGAAGC TAGAAACAGC 102 0 

TTAAAAT 1027 
(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1990 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 232: 

CGGTTCAAAT GGTGCAGGTA AATCTACGTT AATTAATTCT ATTGTAGGTT TTCAAGAGAT 60 

TTATTTAGGA GAAATAGAGT ATTGTGATAA AGATTTGATA GTTAGTTCTC AACCTTTTGC 120 

TCATTTAGGC TTTACTCCTC AAACCACAGT AATTGATTTT TATACTACTG TGAAGGACAA 180 

TGTAATATTG GGGCTGAACC TTGCTGGAAA GTTTGGGAAA AATGCTGAGA AGTTGTGTCA 240 

AATAGCCTTA GAAATTGTTG GGTTAGCTGA TAAAAAAAAT AATTTGGTAG AAACATTGTC 300 

AGGTGGACAA CTGCAACGCG TCCAGATTGC TAGAGCAATA GCTCATAATC CAGATTTTTA 360 

TATTTTAGAT GAACCTACCG TTGGTTTAGA TACTGAATCT GCCGAAAAAT TTTTAATGTA 42 0 

TTTAAAAGAT AAGAGTTTGG AAGGAAAAAC TATTATCATA TCTTCACATG ACATAAATCT 480 

ACTCGAAAAG TTTTGTAAAA AAATACTTTT TTTACAAAAT GGCTCCATAT CATTTTTTGG 54 0 

TGATATGCGT GACTTTGTAG ATAATTCAAC TATCAAATTA AATTTTTCAA TGCAGAATAG 600 

AATTTCTAGA TATCAAATTG AATTTTTAGA AAATTTTAGA TTTAAAGTTC ACATCGAAGA 660 

TAATGATAGT TTTACAATAG AAGTCCCTAT AGAAGAAAAG ATCTTAGATG TTATCAATGA 72 0 

GGTAGGAAAA GCATGTGAAA TTAAAAACTT TTCAACAAGT AAATTAACCT TACAAGAAAG 780 

TTATTTGCAA AGAATAGGAG GAGAAAAATG AAGGCTGATC AATTAAGGCA CAAATCGGAC 84 0 

TTAGGTTTAA GAGGTCTAGC GATTATTGCT AAAAATGAGA TTATTGCTTT TTTTAGAAGT 900 

AAAGGTTTAA TTATTTCTCA GTTTCTACAA CCAATCTTAT ATGTTGTTTT TATAATAATA 960 

GGATTAAATT CTTCGATAAA GAACATTCAG TTTAATGATA TAAAAACCTC TTATGCAGAA 1020 

TATACAATCA TTGGTGTTAT AGCTTTATTG ATAATCGGGC AGATGACTCA AGTTATTTAT 1080 
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AGGGTGACAA TAGATAAAAA ATATGGGCTA CTTGCTCTTA AGTTATGCAG TGGAGTTCGT 
CCTTTATATT ATATTTTAGG GATGAGTATC TATTCTATAT TAGGGTTGAT AGTTCAAGAA 
ATTATTATAT ATATAATTAC GTTAGCGTTT GAGATAAATA TCGCAATGGA TAGATTTTTT 
TATACAGTTT TGTTATCTAT TGTTGTTTTA TTATTTTGGG ACTCCCTTGC AATTTTACTT 
ACAATGTTTA TCAATGATTA CAGAAGACGT GATATTGTAA TACGTTTTGT ACTAACACCG 
CTTGGTTTTA CAGCTCCTGT TTTCTACTTA ATAGATTCTG CTCCTAGTAT TGTGAGATGG 
ATTGGTCAGT TAAATCCCTT AACTTATCAA TTAACTATTT TGAGAAACTT TTATTTTAAA 
AATTCAACAA CTTTGGAATT AGTTTTCTTA TTGTTAACAT CATTACTTGT CCTTATATCT 
GTATCTTTTA TTATACCAAA GATAAAATTG ATACTGATAG AAAGATAAAA GTTGGGTCAT 
CCAACTTTTT TGTTGTCTCC CGAAAACCAC TAGCTATGCT AGTGGTTCCA TAGAGCTTTT 
AGCGTGGTAA CAAAAAGAAC CTCCTAAAAT GATAAGATAG AAGTGGTTTC TCCGCCACTA 
CAACATATCA TACAGGAGGT ACCTCATGAG AGAGGATAAT CAAAGTTTAT CACATACCAC 
ATGGAATTGT AAATATCATA TTGTTTTTGC ACCCAAATAT CGTCGTCAAA TCATTTATGG 
CAGATACAAA GCTAGTATCG GAAGAATCAT ACGTGACTTA TGTGAGCGTA AGGGTGTAAT 
AATCCATGAA GCGAATGCTT GTTCAGACCA TATTCACATG CTTATCAGTA TTCCTCCGAA 



INFORMATION FOR SEQ I 



SEQUENCE C 

(A) LENGTH: 4766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 3: 

GAACTATATT GCATATATTT CTAGCAATGA TCATGGCGAA TCTTGGTCTG CACCAACTTT 60 

ATTACCTCCT ATAATGGGAC TTAATCGGAA TGCGCCATAT TTAGGTCCTG GACGTGGAAT 12 0 

CATTGAAAGC TCAACTGGAC GTATTCTTAT TCCGTCTTAC ACTGGTAAAG AGTCTGCGTT 180 

CATTTATAGT GACGATAATG GAGCATCTTG GAAAGTTAAA GTAGTGCCAC TTCCTTCTAG 24 0 

TTGGTCAGCA GAAGCACAAT TTGTAGAATT GAGTCCAGGA GTAATTCAAG CATATATGCG 300 

TACAAATAAT GGTAAAATTG CATATTTAAC AAGTAAAGAC GCAGGTACTA CTTGGAGTGC 360 

ACCGGAATAT TTGAAATTTG TTTCAAATCC AAGTTATGGA ACACAATTAT CAATCATCAA 42 0 
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TTATAGCCAA TTGATTGATG GTAAAAAGGC TGTCATTTTA 
TGGTCGTAAA CACGGACAAA TTTGGATTGG TCTAATTAAT 
GCGTTATCAT CACGACGTTG ATTATAGTAA CTATGGATAC 
GTTACCAAAT CATGAAATTG GATTGATGTT TGAAAAATTT 
ACTTCATATG AAAAATGTTG TACCATATAT AACATTTAAG 
TTAAAGCTGA AATTTGAAAA TATATAAAAA GAGGATAAAA 
TGTTGGAGCT GGATATTTTG GAGCTGATTT AGCTCGCTCA 
AAAAGTGGTT GCGGTATTTG ACCCAAATCA TGGAGAAGAA 
AGATGTTTGT GCAAGTTTAG ATGAACTTGT AGCACGTGAA 
AGCTTCACCT AGCTACCTTC ACCGTGAACC AGTTGTGAAA 
CGTATTTTGT GAAAAGCCAA TTGCATTGTC TTATGAAGAT 
ATGTAAAGAA AATAATGTCA TCTTTATGGC TGGTCACATC 
ACACCATGCT AAAGAATTGA TTACTCAAGG TAAAATCGGT 
TGCTCGTACA GGTTGGGAAG AACAACAACC AACTGTATCA 
ATCTGGAGGA CATTTGTACC ACCATATTCA TGAATTAGAT 
AGGACTTCCT GAAAAAGCGA CAATGGTAGG AGGCAATGTA 
TGGTGATGAA GATGATATGC TCATTGTAAA CTTAGAATAC 
TTTGGAATAT GGTAATGCTT TCCGTTGGGG TGAACACTAC 
AGGAGCTATC AAACTTGACT TGTTCAATAC TGGCGGTACT 
AGAATCACAC TTCTTAGTTC ATGAAACTCA AGAGGAAGAT 
TACCGGTCGT GGTATGGATG GAGCAATTGC GTACGGTAAA 
ATGGTTGCAA ACATGTATTG ATAAAGAAAT GGAATATCTA 
AGAAATTACA GAAGAATTTG AAAAACTTCT CAATGGTGTA 
TACCGCTGAT GCATGTACTT TATCAGTTAA AGAAGATCGA 
CACAAATGCT TAACTTTTGT AAAACAGAAT AGTAAATTCT 
AGTTCTGTGA TACAACTCAT TGAATAAAGA AATAGAGATG 
CCCATTTTTT ATCAAAAAGT AATGAGATCA AAAATGTGGG 
AGGTATCGAT ATTGGCGGAA CAACAATTAA GGCAGATTTA 
TTTGAATCAT TTCAAAGAGA TAGAAACAAT TATTGACTAT 
ATTAAATCAG GTCTGTGATT TAATTGGTGA GTATACTTTA 



AGTACTCCAA ACTCCACAAA 4 80 

GATGATAATA CAATTGATTG 540 

TCATATTCAA CATTGACAGA 600 

GATTCATGGT CTCGTAATGA 6 60 

ATTGAAGATC TGAAAAAGAA 72 0 

ATTATGGTAA ATTACGGTAT 7 80 

ATGAACAAAA TTGAAGATGC 840 

GTTGCTCAAG AGTTGGGATC 900 

GATATTGATT GTGTGATCGT 9 60 

GCTGCTCAAC ATGGCAAACA 1020 

TGTAAAGCCA TGGTTGACGC 108 0 

ATGAACTTCT TTAACGGTGT 1140 

AAAGTTCTTT ATTGCCATGC 12 00 

TGGAAGAAAC TTCGTTCTCA 12 60 

TGCATTCAGT TTATCATGGG 13 2 0 

TATCATAAAG GTGAAAACTT 13 80 

TCTGATGATC GTTATGCTGT 1440 

GTCTTGATTC AAGGAACTGA 1500 

CTTCGTGTTA AAGGTGAAGG 15 60 

GATGATCGTA CAGCTATCTA 162 0 

CCAGGAGTAC GTTGCCCATT 168 0 

CATGACATCA TTAAAGGTGG 1740 

GCTGCTTTAG AATCAATCGC 1800 

AAAGTAAGTC TTTCAGAAAT 18 60 

TGTCATTATA TAATTTCTAA 1920 

GGACTGGGAT AATGCCCAGT 19 8 0 

AGTGTTGAAA TGAAGATTAT 204 0 

TACGATGAGT TTGGAACGAG 2100 

GATTTGGGAA CGAATCAGAT 2160 

AATCATTCAA TTGATGGTGT 222 0 
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TGGGATTTCC ACTGCTGGAG TTGTTAATGC TAATACTGGA GAAATCATCT ATGCAGGCTA 22 80 

TACAATACCA GGGTATATCG GAGTAAACTT TACTGCCGAA ATAGAAAAAC GTTTTGGGTT 2340 

GTATACTTTT GTTGAAAATG ATGTTAATTG TGCTGCATTA GGTGAATTGT GGAAGGGACA 2400 

AGCCAAAGAT AAGAAAAATG TAGTAATGGT TACTATTGGA ACAGGTATAG GAGGCAGTAT 246 0 

TATTGTCAAC GGACAAATTG TTAACGGATT TAACTATACT GCTGGTGAAG TAGGTTATAT 252 0 

TCCTGTAGGT AATTCGGATT GGCAAAGTAA AGCCTCAACA ACCGCATTGA TTCATTTATA 25 80 

TCAAAAAAAG AGCTTGAAAA CTAATCAAAC TGGACGTACT TTCTTCACTG ATTTAAGATC 2640 

TGGAGATAAA GTTGCTGAAG AAACTTTTGA AATTTTTGTA GAAAATCTAA CAAAAGGTTT 2700 

ATTAACGATT TCTTATCTAC TTAATCCAGA AATTCTCATA TTAGGAGGTG GGATTCTGGA 2760 

TAGTAAGGAT ATTTTGTTAC CTGAAATTCA AAGTTCTTTA GCTAAAAATG CAATGGATAA 2 82 0 

TAGGTTTTTA CCTAAAAATC TTGTGGCAGC TACATTAGGA AATGAAGCTG GTCGTATAGG 2 880 

AGCTGTAAAA AATTTCTTAG ATAGAATTTC TAATAAATAG TATGTAAGAT AAGGAGGTGT 2940 

CACAATGACT AACTCTGTAT TTTCGACAAT GCAAGATATT GAGAATGTTG CAACCGATAT 300 0 

TATAAAATCA TATGATAATG AGATTTATAC TTATAAAGCT GTTTCCCAAG AAGAATTGGA 3060 

AAAACTAGAA AAAAGTTATG ATGAAAAAAG TCACGAAGAA TTAGTTTCAA TAGAAAGCAA 3120 

TTTAGAAATG AAACAACAGA ACCTTATTGA TGAGGTTAAT AAAACAATCA AGGAAAATGA 3180 

TGCAAATATT CAGTATATTT CATCAAGTAG GAGAGGAGAA TTTGTAGAAA AAATTATTGG 3240 

TAGGGTGGTA GAAAAATATG GCCATTAGTC AGATGAAAAG AATCTCTCTA CTATTTTCTA 330 0 

AAAGTAGTCT TGATGATGTT TTAAAAACTA TTCAAGAACT AGAGTCAGTG CAGTTCCGTG 3360 

ATTTAAAGGT TCAGGATAAC TGGTCAGAAG CTCTAGAAAA AGATGAAGTT GTATTTCCAA 3420 

CTATTCAAAT TTTTCATACT TCTAATTCCA ATCATGGGGT TATTGAGGGA AATGATGCCT 3 480 

TGACTTATTT GATGAATCAA CAACAACATT TAGAAGCAAC TGTAGAGAAA TTACAAGAAT 3 540 

ACCTACCGAA AGAAAACACG TTTAAATTAT TGCAGCAACC TCCGATAACT ACCTCTTATG 3 600 

AAGAATTAGA GAAATTTGGT AAAGCTAATG TTGCTGAGGG TGTTCTTAAA AAAGTGAATC 3660 

ATCAAATTAA CAGAGTTCAT GAATTAGAAA GACACATTCA AAGTAATAAT GAGGAAATAG 372 0 

AGCGATTAAT AAAGTGGGAA AAATTAGAAA TTGTTCCTGC GAATTTAGAA CAATTTTCTT 3 780 

TCTGTAAAGG AAAAGTCGGA ACAATTCCAA GGACTGAAGA TAATCGCTTA TACAATAGTC 3 840 

TTTTAGAAAA CAATATTGAA GTTCAAGAAA TATTTTCTAA TGATAGAGAG TACGGTGTTG 3 900 

TTGTTTTCTA TCAGTCTAGT TACTCTATAG ATTTTGATGA ATACTTATTT GAACCATTTG 3 960 
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ATTATTCTAG AAAGGAATTA CCGAAGCAGC GAGTAGTAGA TTTAGATCAA GAAAACATGC 4020 

AGTTAATAAC TGAAAAAGAG AATATTATCG CATCGTTGCA AGATTCAAAG AAATATTTGA 4 080 

TAGATTTACA ATGGCAAATA GACTATATTT TATCTATCTA TGCTCGTCAA ATCTCTAAGA 4140 

ATAACTTTTT GTGCACTCCG CATCTAGTTG CATTAGAAGG ATGGATAGAA GAAACTCGTA 42 00 

TTTTATATTT TATAAAAGTT ATGGATGAGC ATTTTGGACA TTCTATTTAT ATTTATGAAT 42 60 

CGGAAACATT GACGGATAAT CAAGATGAAA TACCTATCAA ATTAACGAAT CATTCTTTAA 4320 

TTGAACCATT TGAATTATTG ACAGAAATGT ATGCTCTGCC CAAATATTAT GAGAAAGATC 43 80 

CTACACCTGT ATTAGCACCA TTTTACTTTA CATTTTTTGG AATGATGGTT GCTGATTTAG 4440 

GCTATGGTTT ACTATTGTTT TTAGGAACAA TGTTAGCATT AAAAATTTTT CATCTACCTT 4 5 00 

CAGCAACTAA GAGATTTTTA AAATTCTTTA ATATATTAGG GGTAGCCGTT GCAATTTGGG 45 60 

GTGGAATCTA TGGCTCATTT TTTGGATATG AGTTGCCATT TCATCTGATA TCTACAACCT 4 62 0 

CTGATGTCAT GACTATATTA GTAGTGTCAG TTGTGTTTGG GTTTATTACA GTATTTGCAG 46 80 

GTTTGTTAGC TTCAGGACTA CAAAAAGTAA GAATGAATAA ATATGCAGAA GCATATAATT 47 4 0 

CAGGATTTGC GTGGTGTGTT ATTCTG 47 6 5 
(2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 234: 
CCTTTTAGAA AAAATTAAAG AATACGACAC CATTATCATT CATCGTCATA TGAAACCAGA 60 

CCCTGATGCC TTGGGAAGTC AGGTGGGATT GAAAGCCTTG CTGGAACATC ATTTCCCAGA 12 0 

AAAAAC C AT C AAAGCCGTCG GTTTTGATGA ACCAACTCTT ACTTGGATGG CTGAGATGGA 180 

TCTTGTTGAA GATAGAGCCT ACCAAGGCGC ACTTGTCATC GTCTGTGATA CAGCTAATAC 240 

TGCTCGTATC GATGATAAGC GCTATAGTCA AGGTGATTTT CTCATTAAGA TTGACCACCA 3 00 

TCCAAATGAT GATGTATACG GTGACCTGTC TTGGGTCGAT ACTAGTTCAA GTAGCGCTAg 360 

aGaTGATTAC CCTATTTGCC CAAACAACCC AACTAGCCTT GGCAGATCGC GATGCTGAGT 42 0 

TGCTCTTTGC AGGAATTGTC GGTGATACAG GTCGCTTCCT CTACCCTTCT ACCACTGCAC 480 

GGACTCTTCG CCTGGCTGCT TATTTGAGAG AACATAACTT TGACTTTGCG GCTCTCACTC 54 0 

GCAAAATGGA CACTATGAGC TACAAAATTG CTAAACTGCA AGGCTACATC TACGACCATC 600 
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TGGAAGTGGA TGAAAATGGT GCTGCTCGCG TTATCCTGAG TCAGAAAATC TTGAAACAAT 660 
ACAATATAAC CGATGCTGAA ACTGCGGCCA TTGTAGGTGC ACCTGGACGC ATTGACAGAG 72 0 

TGAGTCTCTG GGGAATTTTT GTCGAACAGG CTGATGGCCA CTACCGAGTT CGCTTACGCA 780 
GTAAAGTCCA TCCTATCAAT GAAATTGCCA AGGAGCATGA TGGTGGAGGC CACCCTCTAG 840 
CAAGTGGTGC TAATTCCTAT AGCCTAGAAG AAAACGAAAT CATCTACCAA AAGTTAGAAG 900 
ACTTGCTTAA AAACTGATAA AATACTTGCC AAACTTTTCA GAATCTGATA GACTAGTATA 960 

GTAACAATCT ATGGCTCGCA AAGAGACCAT GGCAGAAAGG AAATATTGCA AAATGAAAAr 1020 

AGATATCCAT CCAGAATATC GCCCAGTTGT CTTCATGGAC ACAACTACTG GTTACCArTT 1080 

CCTTAGCGGT TCAACAAAAC GCTCTAACGA AACAGTTGAG TTCGAAGGCG AAACTTACCC 1140 

ATTGATCCGT GTGGAAATTT CATCAGACTC ACACCCATTC TACACTGGAC GTCAAAAGTT 12 00 

CACTCAAGCA GATGGACGCG TGGATCGTTT CAACAAAAAA TACGGTCTCA AATAATGATA 12 60 

AGAGAACAGT TTTGGCTGTT CTTTTTTGTT TCTTGAAATC AACTGCTGTT TTCATGTTCC 13 20 

AGACTCATCT GTAGGTTCGA TTTCCATGCT ACTAGGCAGG AAGGAAATAG CTGTTTCAAC 13 80 

ACGTCCATAA TGAGCTATAC TATTGTCACG AACCACACTT TCATTGATGG TCCAAGTGGA 1440 

ATTCATTTTC TTAAAAGCTT CTCGGACTTT TTCCAAATCT TTGGAGGCAA TGGCCTGCTC 1500 

TAAGGTTTCA AAACGAGGAC TTATACTCAT CTGCTTTCAA AAAGCATTCT AGTCCATCTC 15 60 

CGATTACCGA TGGACTTTAT CACCTCCTTC TCCAGTCCTT GTATGACATC TTGAAGTTGA 162 0 

TTCATGACAT CTTCCAAAGT TCgAAAGGCT TTATTCTTAA ATCCACGTTT ACGAATCTCT 1680 

TTCCACACTT GTTCAATGGG TTCATCTCTG GTGTGTATGG AGGAATAAAG GTAAAATCAA 174 0 

TATTAGTCGG AATATTTAAG GTACTTGATT TATGCCATAT AGCATTGTCC ATAACGAGTA 18 00 

AAAGGATAAG CTTGTGAAAG CTCTTCTAAA AAGGCGTTCA TCCACACTCC TTTTTATAAA 1860 

CCTGAAATAA GGCATCAATT GTAACAAATT CTCCTGCCTC TGTAGCCTTC AAATGACGGG 192 0 

CAAGAAAGGC TTTCTCTTCC TCAACTGTCA TATATGCATG GTTACGACCA CCACGTGTTT 1980 

CTTGAAGGAG AGAGTCGAGT CCGAACTCCT CATATTTTTT TACGTTTCGC CAAATCGTTG 204 0 

'JTTGATTACA GTCTAAAAGC TCTATAATCT CTTTATAAGA TTTGCCCATC ACACCAAATA 210 0 

TAGTAGATTG AAACTAGAAT AGTACACCTC TACTTCTAAA ACATTGTTAG AAATCGATTT 2160 

GTCCTGTTCT TGTTTCATTT TACTATAGAA CGATTTGAAG GCGTTTATAA TATTTAGCTG 22 2 0 

TACGAGAGTC TTTTAAAAGT GTTTTGATGG TTTGGATTTC TTCTTTAGTT GATTTCATAT 22 8 0 

TACTATTATA TAATGCTTTT TGATTTTAGT CTGGTATAAA TATTGCTTTC CTCCAAAATG 23 4 0 
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GTCATAGTTT TACTGGCAAA TCTAACATAT CACGGATAAA TTAACAAGTG ATTTCTGAAT 
TGCTAAACAT TTTCTTTTCT TATAGCATAC TTTAAGATTT TGTCTTTGAG AAAGATATTT 
CCAAGAAAAA CGTTCGTTTT TTGG 
(2) INFORMATION FOR SEQ ID NO: 23 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

CTAGATATAG CTATAATTTT ATTTATAACA AGAGGATAGA AATGACCGAA TTAGAAAGAA 60 

AAAATCGAAA AATTAGCTAA GAAATATTCT GATAACTTAA ACATCAAAGT TCAAGAGAGA 12 0 

GTTCGTGAAA TGGCAAATGA TAATAAGAGC CATTATTTGA TATACAGAGT TTTAGGTATT 180 

TCATTTGAAG AAGGAGAAAA TATCGATTTG TATCAAAATA AAGGTCGTTT TTTATACAAA 240 

TATGCTGGTT CATTTTTAGA AGAAGCTGCA GTACTATGCT TTAACGAAAA ATTTGGTACA 300 

GAAAATACTT AAAAAGTTAA CATTCCTAAT TCTGAAAGTA CAAAACCTAA GACTTTTGAA 3 60 

ATTGATTGTT TAGTCGGAGA AAAACACGCA TACGAAATAA AATGGTGGGA TGCAACTACA 42 0 

GATGGAGACC ATATAACTAA AGAACACACT AGAATAAAAG TTATTCATAA CAAAGGATAT 48 0 

ATACCAATTC GGTTAATGTT CTACTATCCA AATAGAACTC AAGCTATAAA AATTCAGCAA 540 

ACTTTAGAAA CATTGTATAA CGGTATTGGA GGGAAATATT ATTATGGAGA TTCTGCCTGG 600 

GAACATTTAA GAGCAGTGAC CGGTATTGAT TTACTTAGTA TTCTAACAGA TATTGCAAAT 660 

AAAAAAACAG GGGTAAAATC AAAATGACAG TATTAAAAGG AGATAACTTA GAAATATTAA 72 0 

AAACTATTGA ATCCTCAAGT ATTGATTTAA TCTATATGGA CCCTCCTTTC TTTACACAGA 78 0 

AAACCCAAAA ATTATCTAAT AACAAAAATA TTATGTATTC ATTCGAAGAT ACGTGGAC1T 84 0 

CGATTGAGGA TTACAAAGAA TTTTTGTCTG TAAGATTAGA AGAATGCAAA AGAGTGCTAA 900 

AAAATAGTGG CAGTATTTTC GTTCATTGTG ATAAAATTGC AAATCATCAT ATTAGATTAA 960 

TTTTAGATAA TATCTTTGGA GTAGATATGT TTCAAAGCGA AATTATATGG AACTATAAAC 102 0 

GGTGGTCTAA TTCAAAAAAG GGATTATTGA ACAATCATCA AAACATTTAC TTTTATTCAA 1080 

AGTCAAAAGA TTTTAAATTT AATACAATTT TTACAGAGTA TTCTTCTACT ACAAATATCG 1140 

ACCAAATACT AGTGGAACGA AAACGAGATG GAAACTCTAA AACTATATAT AAGGTTGATA 1200 

ATAATGGTAA CTATATTCTA GCAAAAGAGA AAAATGGAGT TCCCCTTTCA GATGTTTGGA 1260 
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ATATACCATT TCTTAATCCA AAAGCTAAAG AAAGAGTAGG TTATCCTACA CAAAAACCTA 13 2 0 

TTCTGTTATT AGAACAAATT ATAAAGATTG CTACTGATAA AAATGATATA GTTTTAGACC 13 80 

CGTTCTGTGG AAGTGGAACT ACTTTAGTAG CCTCCAAGAT TTTGAATAGA AATTATATGG 144 0 

GGATTGATTT ATCTGAGGAA GCTATCAATA TAACTCAGCA ACGTCTGGAA AATGTTATAA 15 00 

AAACAAGTTC AAATTTATTG AATAAAGGAA TCGAAGCATA TAGAAC CAAA ACTGAGGAAG 1560 

AGGAAAACAT TCTTAAATTA TTACAGGCAA AAATTGTTCA AAGAAATAAA GGAATTGATG 162 0 

GTTTTTTACC TAAACATTTT CAAAAAAAAC CGATACCTAT AAAAATTCAA AAAAATAATG 168 0 

AATGTCTGAA TGAGAGTATC TCTTTATTAC AGAATGCTAT AAACTCCAAA AAACTTGATT 1740 

TTGGAGTAGT TATAAAAACT CATTCG 17 6 6 
(2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 236: 

CCGAAAATCA AATTCAAACC ACGTCAACGT CGCCTTGCCG TACTCAAGTA CAGCCTGCGG 60 

CTAGTTTCCT AGTTTGCTCT TTGATTTTCA TTGAGTATTA AACTAAATTA AATAATATTA 120 

GCGCGGAGAA TTTCTAATTC TTCCTTGGTC AAGCGACGCC ATTCCCCTCG TTCTAGGTTC 180 

TCATCTAATA CTAAAGTTCC CATAGTCAAT CGTTGCAAGT CCACCACTTC CTTGCCACAG 240 

TAGCCCACCA TACGCTTGAT CTGATGAAAC TTCCCTTCTG CAATGGTCAC ACGGATTTGG 300 

CTTTGATTCT TTTCTGTATC TATGGATACA AGCTCCAGTA TAGCGGGTTG ACAGGTAAAG 3 60 

TCTTTGAGAG GAATACCCTC AGCAAATGTC TCCACATCTT CTTGGGTCAT GATTCCCTTC 420 

ACTTGTGCCA GATAAGTCTT GTCCACATGA CGCTTGGGCG AAAGAAGAAC ATGAGCCAGC 480 

TGACCATCAT TGGTCAAGAG CAAAAGACCA TGCGTGTCAA TATCCAAGCG TCCTACTGGG 540 

AAAACTTCCT TACTCCGCGC CAAGTCATCC AACAAGTCCA GAACGGTTCT GTGCTTGGGA 600 

TCCTCAGTCG CTGAGATAAC TCCTTTGGGC TTGTTCATCA TGTAGTAGAC AAACTCTTCA 660 

TACTCCAACA CTTGCCCATC AAAGCGAATC TCATCTATTT TTTCATCAAT CTGCAATTTA 720 

GCTGATTTTT CTTTTTGACC ATTTACAG 748 
(2) INFORMATION FOR SEQ ID NO: 2 37: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

AAAAGATTAC ATTGCAACAA TTGAAAATTA TCCAAAGGAA GGCATTACCT TCCGTGATAT 6 0 

TAGTCCTTTG ATGGCTGATG GAAATGCTTA TAGCTACGCT GTTCGTGAAA TCGTTCAGTA 12 0 

TGCTACTGAC AAGAAAGTCG ACATGATCGT GGGACCTGAA GCTCGTGGAT TTATCGTGGG 180 

TTGTCCAGTT GCCTTTGAGT TGGGAATTGG TTTTGCGCCT GTTCGTAAGC CAGGTAAATT 240 

GCCACGCGAA GTTATTTCTG CTGACTATGA AAAAGAGTAC GGTGTCGATA CCTTGACTAT 3 00 

GCACGCGGAT GCCATTAAGC CAGGTCAACG TGTTCTTATT GTAGATGACC TTTTGGCGAC 360 

AGGTGGAACT GTTAAGGCAA CTATCGAGAT GATTGAAAAA CTTGGTGGTG TTATGGCAGG 42 0 

TTGTGCCTTC CTTGTTGAAT TGGATGAATT GAACGGCCGT GAAAAAATTG GTGACTACGA 480 

CTACAAAGTT CTTATGCATT ATTAATGAAA ACAGTCCCTA GGGCTGTTTT CTCTACACTA 54 0 

GGATATAAAA ATAGACTATA ACTAGTTAGA GAAAAACTAT AATTGAAAAC TATATCTTCT 600 

TGCAGTATAA TAAAAGGACT AAGTGTTTGA GATTTGTCTT CAAACATATG CAATTATTCC 660 

TGAAAGAGTA CAGTTAGGAG AGGGTTATGC CGATTCGAAT TGATAAAAAA TTGCCAGCTG 720 

TTGAGATTTT ACGGACAGAG AATATCTTTG TCATGGATGA TCAACGTGCT GCCCACCAAG 780 

ATATCCGTCC TTTGAAGATT TTAATTTTAA ATCTCATGCC ACAGAAAATG GTCACAGAGA 84 0 

CCCAGTTGTT GCGCCACTTG GCTAATACAC CCCTACAACT GGATATTGAT TTTCTCTATA 900 

TGGAGAGCCA CCGTTCTAAA ACAACTCGTT CAGAGCACAT GGAGACCTTC T AT AAAAC T T 9 60 

TTCCTGAAGT CAAGGATGAG TATTTTGATG GGATGATCAT CACGGGTGCT CCAGTTGAGC 1020 

ATTTACCATT TGAGGAAGTG GACTATTGGG AGGAATTTAG ACAGATGCTT GAGTGGTCTA 1080 

AGACTCATGT CTATTCGACC CTTCATATCT GTTGGGGGGC TCAGGCTGGG CTTTATCTGC 1140 

GCTATGGTGT AGAAAAATAC CAGATGGACA GTAAGCTATC AGGTATTTAT CCTCAGGACA 1200 

CCCTAAAAGA GGGTCACCTT CTATTTAGAG GCTTTGATGA TAGCTATGTA TCCCCTCATT 12 60 

CACGGCACAC GGAGATTTCT AAGGAAGAGG TCTTAAACAA GACCAATCTC GAGATTTTAT 132 0 

CAGAAGGACC TCAGGTTGGG GTTTCTATTw TGGCCAGTCG TGATTTACGA GAAATTTATA 1380 

GTTTTGGTCA TTTGGAGTAT GACCGTGATA CTTTGGCAAA AGAGTATTTT CGAGATCGTG 1440 

ATGCAGGTT 1449 
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(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

!A) LENGTH: 904 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 

TACCCGCTTC TTTCAAGAGT TGGAGCAGGG CTTGTTTGCG ATCTTTTGTC ATAGTTCTTC 60 

CTTTTAACGG CGTTTTCGAA GCACTTTATA GACAGCTAGT GCTAATGTAT AGTCTACCAT 12 0 

ACTATGGATA ATTGTACCAA ATCCAACTAG TACAAATAGA ACATAAAACA TATTTTCTAC 18 0 

ATTGGTACCA GAAGTTGCGT AAAAAACGAC ACAGGCCAAT ACTTCAGCAA GGGCATGAAC 24 0 

AACAGCCAAA ACAAAGTTGA AAATCCAGGA AGATTTTGGT TTATCTAGGG TATCGGGGAA 300 

TTTTTGTAGG TAAAGAGCTC CTAAAGCACC AAAAGATATA TGGGAAAAAG CCCGAAAAAC 3 60 

GATAACCATG GGATAGCCAG CCATCAAAAA TCCAAAACTA GAGGCTAGGA TGACAAAAAC 42 0 

TGCCATCAAG GGCGACAAGA ACATGGCTAT AAAAATAGCG ATGTGGCTCC CCAAAGTATA 480 

GGAAGCAGGT GGAATGACAA TCTTGAAAGG CATAACAATT GGAATCAAAA TCGCAATAGC 540 

CGTTAAAAGG GCTGTCATTG TCATAAATTG TGTCTTTTTC CGTGTATTCA CAAGAATCTC 60 0 

CTTTTTAACT GCATATACAC TAGTATGGTA CAATAAACCA GACAATAAAG CAAGAATTTA 660 

CTTGGGTTTA TAGATCATTT TTTAGTTAAA AGTTATAGTA GATTGAAACT AGAATAGTCC 720 

ACCTCTACTT CTAAAACATT GTTAGAAATC GATTTGGCTG TCCTGATCGA TTTGTCCTGT 780 

TCTTATTTCG TTTTACTATA GTAAAGATTT CATTAAAAAG AAACTGTATA GAGCAAAATC 840 

TCCACCTTCA GGTTTGGAAA GCGGAGATTG TTTnTTATTT TTTCCAGGGT TTGTAGTCGT 900 

GGGA 904 
(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 239: 
CACTCAAACA TGACTTATAT CAAGACGGAT GGACTTCAAG ACGATGCCAA TCGCTTGAAT 60 
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CGTAACATTC AGTTTGGTGT TCGTGAATTT GCAATGGGAA CAATCTTGAA CGGGATGGCC 12 0 

CTTCATGGTG GACtTCGTGT ATACGGTGGA ACTTTCTTCG TCTTCTCTGA CTATGTGAAG 180 

GCAGCTGTCC GCTTGTCAGC CTTACAAGGA CTTCCTGTGA CTTATGTCTT TACCCATGAT 24 0 

TCAATCGCAG TTGGGGAAGA TGGTCCGACT CATGAACCAG TTGAGCATTT AGCAGGTCTT 3 00 

CGTGCTATGC CAAAT CT AAA TGTTTTCCGT CCAGCAGATG CGCGTGAAAC GCAAGCAGCT 3 60 

TGGTACCTTG CAGTGACAAG TGAGAAAACA CCAACTGCCC TTGTCTTGAC ACGTCAAAAT 420 

TTGACTGTTG AAGATGGAAC AGACTTCGAC AAGGTTGCTA AAGGTGCTTA TGTTGTATAT 480 

GAAAATGCAG CCGACTTTGA TACCATCTTG ATTGCGACAG GTTCAGAGGT TAATCTTGCT 540 

GTCTCAGCTG CTAAAGAATT GGCTAGTCAA GGCGAAAAAA TCCGCGTAGT CAGCATGCCA 600 

TCTACAGATG TCTTTGATAA ACAAGATGCA GCTTACAAGG AAGAAATTCT TCCAAATGCA 660 

GTCCGCCGTC GTGTTGCAGT CGAAATGGGT GCAAGTCAAA ACTGGTACAA ATATGTTGGT 72 0 

CTCGATGGTG CCGTTCTAGG TATTGATACT TCGGAGCCTC TGCCCCAGCA CCAAAAGTAT 7 80 

TGGCAGAATA TGGCTTTACT GTAGAAAATC TTGTAAAAGT TGTTCGAAAC TTGAAATAAT 84 0 

CCTAAAAATC AGGGCGTAAG CTCTGGTTTT TCTTACCAGA AAAGTAAGGT ACAATCTTGT 900 

AAAAGTAGCT GAAATTTGAT ATAGTAGTCC TATGTAAAAG ACAAAG 94 6 
(2) INFORMATION FOR SEQ ID NO: 240: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2764 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : douole 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 240: 

CGGGGCTCCC TAGTTCTTAG GGAGCTATTT TTGTTTTTTC AAGAAGTTAT CTTCTTGTAT 60 

TTTATACTCA ATGAAAATCA AAGAGCAAGC TAGGAAACTA GCCGTAssTG CTCAAAAC AC 120 

TGTTTTGAGG TTGTAGATAA GACTGACAAA GTCAGGAACA CATATCTACG GCAAGGCGAC 180 

GTTGACGCGG TTTGAAGAGA TTTTCGAAGA GTATTAGTTG TGAATCTGGT GCAGTCGTCC 24 0 

CAGATTATTC TTATTAGTAG GGTCTTGTTT TCTATATCCC CTCGTAGTTA ACAAGACCTT 300 

GAGCATTTTA GAAAGAGGAA TCTATGTCTA CGAAATATAT TTTTGTAACT GGTGGTGTGG 360 

TATCGTCCAT TGGGAAAGGG ATTGTGGCAG CGAGTCTAGG CCGTCTCTTG AAAAATCGTG 420 

GTCTCAAAGT AACCATTCAA AAGTTTGACC CTTATATCAA TATTGATCCG GGAACCATGA 48 0 

GTCCTTACCA GCACGGGGAA GTTTTTGTGA CAGATGACGG AGCTGAGACA GATTTGGACT 540 



WO 98/18931 



PCT/US97/19588 



1261 

TGGGTCACTA TGAACGTTTC ATCGATATCA ATCTCAACAA ATATTCCAAC GTGACAACTG 600 
GGAAAATTTA CAGTGAAGTT CTTCGTAAAG AACGCCGTGG AGAATACCTT GGGGCAACTG 560 
TTCAAGTCAT TCCTCATATC ACAGATGCTT TGAAAGAAAA AATCAAGCGT GCCGCTCTAA 72 0 

CGACCGACTC TGATGTCATT ATCACAGAGG TTGGTGGAAC AGTAGGAGAT ATCGAGTCCT 780 
TGCCATTCCT AGAGGCTCTT CGTCAGATGA AGGCAGATGT GGGTGCGGAT AATGTCATGT 840 
ATATCCATAC AACCTTGCTT CCTTACCTCA AGGCTGCTGG TGAAATGAAA ACCAAACCAA 900 
CCCAACACTC TGTCAAAGAA TTGCGTGGCT TGGGAATCCA ACCAAATATG TTGGTTATTC 9 60 

GTACAGAAGA GCCAGCTGGT CAAGGAATTA AAAATAAACT GGCCCAGTTC TGTGATGTGG 102 0 

CACCAGAAGC CGTTATCGAA TCGTTGGATG TTGAACACCT TTACCAAATT CCACTGAACT 1080 

TGCAGGCACA AGGGATGGAC CAAATTGTTT GTGATCATTT GAAATTAGAC GCACCAGCAG 1140 

CGGATATGAC AGAATGGTCA GCCATGGTGG ACAAGGTCAT GAACCTCAAG AAACAAGTTA 12 00 

AGATTTCCCT TGTTGGTAAG TATGTGGAGT TGCAAGATGC CTATATCTCA GTGGTCGAAG 12 60 

CCTTGAAACA CTCTGGCTAT GTCAATGATG CAGAAGTTAA AATCAATTGG GTCAATGCCA 132 0 

ATGATGTGAC AGCAGAGAAT GTAGCAGAAC TCTTGTCTGA TGCGGACGGG ATCATCGTAC 13 80 

CAGGTGGTTT TGGTCAACGT GGTACAGAAG GGAAAATCCA AGCCATCCGC TATGCGCGTG 144 0 

AAAATGATGT TCCAATGTTG GGAGTCTGCT TGGGAATGCA GTTGACATGT ATCGAGTTTG 1500 

CTCGTCACGT TTTAGGTCTT GAAGGTGCCA ATTCTGCAGA GCTTGCACCA GAAACAAAAT 1560 

ACCCTATCAT TGATATCATG CGTGATCAGA TTGATATTGA GGATATGGGT GGAACCcTTC 162 0 

GTTTGGGACT TTATCCGTCT AAGTTGAAAC GTGGCTCTAA GGCTGCTGCT GCTTATCACA 1680 

ATCAAGAAGT GGTGCAACGC CGTCACCGTC ACCGTTATGA GTTTAATAAT GCCTTCCGTG 174 0 

AGCAGTTTGA GGCAGCAGGT TTTGTCTTTT CAGGAGTTTC TCCAGACAAT CGTTTGGTAG 1800 

AAATCGTGGA AATTCCTGAA AATAAATTCT TTGTAGCTTG TCAGTATCAC CCTGAACTGT 18 60 

CAAGCCGTCC AAACCGACCA GAAGAACTCT ACACTGCCTT TGTTACTGCA GCAGTTGAGA 192 0 

ACAGCAATTA GCAAAATCAG AACCTTTGAG AAAAATCTCA GAGGTTTTTT GCATACGATG 19 80 

ATATTGCAGT ATATCTGAGG TAGGGGTCCT CTGTATGTAC CTGCTACCGT TGAAATCAAT 204 0 

AGCGACTCCC TCTTGCCCTG TGCTAGTGAA TGGATTTATC AGTATATTGA AATGAAATAA 2100 

AATTTGAACA AATTAATTCG GAAAGCCAAA TCAATTTCTA GCAAAGTTTT AGGAACTGGA 216 0 

TTGTATAGTG AATTGAAATA AGATGTGAAC ATCTCTATCA GGAAAGTCAA ATTAATTTAT 222 0 

AGAAATATTT TAGCAGTCAA GATGTACTGT TATAGATTCA ATACATTATA CTTTTTTAAT 2280 
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TTAATCCACT ATAGTAAAAT GAAATAATAA CAGGACAAAT CGATCAGGAC AGTCAAATCG 2340 

ATTTCTAACA ATGTTTTAGA AATAGAGGTG TACTATTCTA GTTTCAATAT ACTATCCCAA 24 00 

ATCATTCATA CCTCTCTCAA CTAGATGTAA CTTACAAAAC CCCTGACCTC ATGAGCCACT 2 4 60 

TTCTTCCTCC TCATGAGGTC AGTTTTACTT TCTGCTGTTC CAGTATCGTT TTTCCTCGCT 252 0 

AGATTTCCTC AAAAGGGCAG ACTCCTCCCT TGGTGCGTCA CACGATTTTT TCATCTCGAC 2580 

TGTTCTTTAA TGCATCATTA ACGACGCTTT TCTTCTAGGT GGTTCATAAG GAACAGGAAG 2 64 0 

ATTCAGGTTG ACTTTTCTAA TCCTAGAATA AAGTGCTGAA AACAATTCGG AATAGGCATA 27 00 

GAGACTAGAC AATTTGAGGA GCTGCTTGCG TCCTGTTCGA ACACATTTTC CCACCACGTG 27 60 

AAGA 2764 
(2) INFORMATION FOR SEQ ID NO: 241: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1682 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 

CCGTTTTTTT CATTGTTCAG TACTACAACT TACGTTGTAG CGCCCTGCAC ATTGGTTCGT 60 

CTTGTTCAGT TTTCAAAGGT CTTTGTCACT TGCTTCTCTC AAGCGACAAC TATATTAGTA 120 

TATCACAACT GCTTTCGCTT GTCAACACTT TTTTGAAGAT TTTTAAGTTT TTTTAAACTT 180 

TTTTTCATCA AGTGGTCCTG ACGCAACATA CCATAGTCCG TACGGGATTC GAACCCGTGT 24 0 

TACCGCCGTG AAAAGGCGGT GTCTTAACCC CTTGACCAAC GGACCTGAGT TGTTATTTTC 300 

AACTCTTACT ATTATACAGT CTTTTCAAAC TTTGTCAACT ACTTTTTTAA ACTTTTTTTA 360 

TTAATTTTAC AACAGCTTCA GTTCGAGCTG TATGTGGGAA CATATCGACC GACTGGATAT 42 0 

AATGAAGATC ATAGACTTCT ACTAAGCGTA CCAAATCACG AGCCAAGGTC GAAACATTAC 480 

AAGAAATATA AACCATTTTT TCTGGTACAT AAGTAAGAAT AGTATCTAAT AACTTATCAT 54 0 

CCAGACCTGT ACGTGGTGGG TCAACAATCA AAGCATCTGC TCGGTAGCCT TCCTTGTACC 600 

AACGAGGAAT AATCTCTTCT GCCGTTCCAG CTTCATAATG AGTATTGTCA AATCCCATTC 660 

TTTTAGCATT TCGCTTGGCA TCTTCAATAG CTTCTGGAAT AATATCCATA CCTCTGAGTG 72 0 

TTTTTACTTT CTTTGCAAAG GCAAATCCAA TCGTTCCAAC TCCACAATAA GCGTCAATCA 78 0 

AATGGTCTTC TTTATCAACA TCCAGCGCTT TTACTGCTTC GCTATAGAGG ACTTCTGTTT 840 

GCTCAGGATT TAGTTGATAA AAAGCTCGAG GGGATAGTGA AAATTCATAA TTGAGTACAC 900 
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CTTCTTGAAT ACTCTCTTGC CCCCAGATAA TCTCTGTCTT TTCACCATAT ATCTCACTGG 9 60 

TTTTAGCTGT ATTTGTATTA ACAGCTACTG TCACAACTTC TGGGAAATCT TTAACCAACT 1020 

CTTTTACCAA TTGAGTTAAA TTAAGCTGGC GGTTTGTAAC AATAATAATC TGAACCTGTC 1080 

CGGTCTTTCT CGCGCGTCGG ACCATAATAG TACGGACACC TAGAACTTTT CTCTCATCCG 114 0 

TGATTGGAAT CTGGTGATAA GTAAGTAATT CTGCTAAGCG ATTAGCAATC ACTTGGGTTT 12 00 

CCTTATCTTG TACCAGGCAG TCTTTCAACT CTACTAAATA GTGAGAGTTT TGTGCATATA 12 6 0 

AGCCCGCCTT GACCTGATTT TTAAATTTTC GAGTCTGAAA TTGTAACTTA GCTCTGTAAT 13 2 0 

ATTTTGGTTC CTGCATTCCA ATAGTTGGAC GAATTTCATA ATTTTCATAT CCTGCAGGAG 13 8 0 

CAAATTTTTT CAGCGCTTGA TGAAGTAAGT CCGTCTTGAA CTCCAGCTGC TTATCATAAT 1440 

GCAGGTGCAT GATTTGGCAG CCTCCGCATT CATTATAAAT AGTACAAGAT GGCACAATTC 1500 

GAAATTTAGA CTTCTTGTTG ACCTTCAGTA ATTTTGCTTC AACAAAGTTG CGTCTAATAG 15 60 

AAGTAATCTG ACAATAGATA TCTTCGCCTT TGAGAGCTCC TGGTACAAAG ACTAATGTTT 162 0 

TTTGGTAAAA GCCGATTCCC TCACCGTTAA TTCCCATGCG CTTGATTTTT AATGGTATTT 1680 

TT 1682 
(2) INFORMATION FOR SEQ ID NO: 242: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 

TTAACTTTGG TCAATTCTTT AAAGTCATCC TCTGTAAGCA TGTCTAACCA TTGATGTTTC 6 0 

CCTTTATTGC TAAAATCACC AATTCCGACT ACAGCTATAT CTAAATCTTT CCAACTATTT 12 0 

TTCAAATTTT CAAAATATCT TGATTGCAAA ATACCATCTG CTAACAATTT ATTTTCTTGC 180 

ACAATCGTTG CATTCATAAA TGTACACTCT CCATGAAATT TTCTAGACAT TTCATAAATC 240 

AGTGTATTCA CATGGTATTT AGCGTGTATG TGACTAGGAC CACCTGCTAG AGGATAGAAG 300 

TGAACATTTC GGACACTTTT ACTGTGAATT AAATCTACTA AATTACTTAA ACTTTTCCCC 3 60 

CAAGAAAAGC CAATTTTCAT ATTATCATCA ATTAGATTCC TAAGGACGCC TGCTGCAACT 42 0 

TGAGAAATTC TTTCAGATAA AATTGTTGGA GTATCATCAA ATTCATTTGG AATAATTTCT 4 80 

AAACTTTCCA AACTGTATTT TTCTTTTACA TAATTTTCCA ACTTAAACAT ATTGGTATCA 54 0 
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ATAGAGGTTC TATAAATTCC TAATTTTGCT GCTATTTGTG ACTGATTTAA GTTTTCAATA 660 

TAATACAGAT AAGCAATTTT AGAAAGCAGT TTATTCCTAT CTTGATTCAT ACACTTAACC 72 0 

TCTTACGAAA CTACCTTAAC CATTATCCCA GCATTTTCTA ATGTAGCTAT ATTTTGTTTA 78 0 

GAAAGTTTTT CGTCTGTTAT TACTTCATAG ACTTGACTTA AAGCAAATCT TCTTACTGTA 840 

CCTCTTTTAT CAAATTTACT TGAGTCAGTT AGGACAATGA CTTTATCCGA CACTGCTGAA 9 00 

ATATATTGAA CTACCTCACT GCGCATTAAA TCTTTTCCGG TAAAGCCCAT CTCTTTATCG 960 

TAACCATCTG TCCCAACAAA AGCTTGACAC ACATGAAAAG TCTGTATCAT TTCTTTTAAT 1020 

AAAGGTCCTA CAGTCACCTG TGAATCTTTC TGAAACTCAC CACCAAGAAC AATAACACGA 10 8 0 

CATGAATCAT AAGCTCTCAC AAAATTTGCT ATAAAAAACG AATTTGTTAC AATCGTAACA 114 0 

TTTCTTTTTT GCTTGCAAAT TTCCTCAGCA AGTAAAGCAC AGGTCGATCC AGATTCTATC 12 00 

ATTATTGTTT CATTATCTGA CACCAATTTT ACTGCTTCCT GAACAATTTT TCTCTTAGTT 12 60 

TCATAATTAA TTGACAAACG TACATTTAAG TCATCTCCAC TATTTAATAC AGCATATCCA 13 2 0 

TGCTCTCTGT GTAATAAACC TTTTGACTCT AATTTATCTA AATCTTTTCT AATCGTTACT 13 80 

TTCGATACAT TTAATTTTTC CGATAATGTA TTAACGTCGA TCTTTTCATA TTCTGATACT 1440 

AATTTAATAA TTTGTTCCAA TCTTTTCATT TTACACCTCC GTTTTATTCT ACCAAAATAA 1500 

AAAGCAAAAA ACAACAAATT AACCTTTCGT TCGTAATTGT TTTTCTTTCG TTTTTGTGAT 1560 

AGGATAGACT TATGAAGAGG AGGAACTCTT ATGGAAATAT CTAAAGGAAT TATTTTTAAT 1620 

ATTCAACACT TTTCAATTCA TGACGGTCCG GGTATTCGTA CAACTGTTTT TTTAAAAGGA 1680 

TGTCCTCTGC GCTGTCCATG GTGTTCTAAT CCTGAATCTC AAAGAATGAA ACCTGAAAAA 1740 

ATGAAAGATG CTCAACGAGA GAAATTCACC TTAGTCGGTG AAGAAAAGAC TGTAGAAGAA 18 00 

ATTATTACAG AGGTATTAAA AGACAAAGAA TTTTACGAAG AATCCGGTGG AGGTTTAACT 18 60 

TTATCAGGAG GTGAAATATT TGCTCAGTTT GAATTTGCTA AAGCCATCTT AAAATCAGCT 192 0 

AAAGAACATC ACATACACAC TGCCATTGAA ACTACTGCCT TTGTTGATCA TGAAAAATTT 19 8 0 

ATTGATTTAA TTCAATATGT GGATTTTATC TACACAGACC TAAAACATTA TAATTCTATA 2040 

AAACATAAAA AAGTGACTGG GGTTTTTAAT CAAATGATTA TTAAAAACAT TCATTATGCT 2100 

TTTTCACAAA ATAAAACTAT CGTTTTAAGA ATCCCAGTTA TTCCTAATTT TAACAATAGT 2160 

TTAGAGGATG CAGAAAAATT CGCTACTCTA TTTAACTCAT TAAATATCGA CCAAGTTCAA 222 0 

CTACTCCCTT TTCATCAATT TGGTGAAAAC AAATATCGTT TATTAAATCG GAAATATGAA 22 80 

ATGGATGGAA TCAACGCACT TCATCCwGAA GATCTTATTG ATT AT C AAAA GGTATTTCTG 2340 
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AACCACCATA TTAATTGTTA TTTCTAGTTT ATTTCCTTGA AATCCTCTAG CTATTTGCAG 2400 

ATAACAAGCA TCTATAATAC ATACTTAACT TTTCAAAAGG TTTAGCTAAA AAATTTTAGC 2460 

CAAACCTTTT CTATTTTACC TTGCTCTAGA ATTTTTAAAC TGCTATACTT AT C AC AAAAA 252 0 

AACG 2524 
(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

CGTGCTTGGG GGCTTGTGGT CAAAAGGAAA GTCAGACAGG AAAGGGGATG AAAATTGTGA 60 

CCAGTTTTTA TCCTATCTAC GCTATGGTTA AGGAAGTATC TGGTGACTTG AATGATGTTC 12 0 

GGATGATTCA GTCAAGTAGT GGTATTCACT CCTTTGAACC TTCGGCAAAT GATATCGCAG 180 

CCATCTATGA TGCAGATGTC TTTGTTTACC ATTCTCATAC ACTCGAATCT TGGGCAGGAA 24 0 

GTCTGGATCC AAATCTAAAA AAATCCAAAG TGAAGGTCTT AGAGGCTTCT GAGGGAATGA 3 00 

CCTTGGAACG TGTCCCTGGA CTAGAGGATG TGGAAGCAGG GGATGGAGTT GATGAAAAAA 3 60 

CGCTCTATGA CCCTCACACA TGGCTAGATC CTGAAAAAGC TGGAGAAGAA GCCCAAATTA 42 0 

TCGCTGATAA ACTTTCAGAG GTGGATAGTG AGCATAAAGA GACTTATCAA AAAAATGCGC 480 

AAGCCTTTAT CAAAAAAGCT CAGGAATTGA CTAAGAAATT CCAACCAAAA TTTGAAAAAG 540 

CGACTCAGAA AACATTTGTA ACACAACATA CAGCCTTTTC TTATCTAGCG AAGAGATTTG 600 

GGCTTAATCA ACTTGGTATT GCAGGTATCT CTCCTGAACA AGAACCAACT CCACGACAAC 660 

TAACAGAAAT TCAGGAATTT GTTAAGACCT ATAAGGTTAA AACGATTTTT ACAGAAAGTA 72 0 

ACGCTTCTTC AAAAGTAGCT GAAACTCTTG TCAAATCAAC AGGTGTGGGT CTTAAAACTC 7 80 

TGAATCCTTT AGAGTCAGAC CCACAAAATG ACAAGACCTA TTTAGAAAAT CTTGAAGAAA 840 

ATATGAGTAT TCTAGCAGAA GAATTAAAGT GAGGAAAGAA TGAAAATTAA TAAAAAATAT 9 00 

CTAGCAGGTT CAGTGGCAGT CCTTGCCCTA AGTGTTTGTT CCTATGAGCT TGGACGTTAC 960 

CAAGCTGGTC AGGATAAGAA AGAGTCTAAT CGAGTTGCTT ATATAGATGG TGATCAGGCT 102 0 

GGTCAAAAGG CAGAAAACTT GACACCAGAT GAAGTCAGTA AGAGGGAGGG GATCAACGCC 108 0 

GAACAAATTG TTATCAAGAT TACGGATCAA GGTTATGTGA CCTCTCATGG AGACCATTAT 1140 
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CATTACTATA ATGGCAAGGT TCCTTATGAT GCCATCATCA GTGAAGAGCT CCTCATGAAA 1200 

GATCCGAATT ATCAGTTGAA GGATTCAGAC ATTGTCAATG AAATCAAGGG TGGTTATGTC 12 60 

ATTAAGGTAA ACGGTAAATA CTATGTTTAC CTTAAGGATG CAGCTCATGC GGATAATATT 13 2 0 

CGGACAAAAG AAGAGATTAA ACGTCAGAAG CAGGAACGCA GTCATAATCA TAACTCAAGA 13 80 

GCAGATAATG CTGTTGCTGC AGCCAGAGCC CAAGGACGTT ATACAACGGA TGATGGGTAT 1440 

ATCTTCAATG CATCTGATAT CATTGAGGAC ACGGGTGATG CTTATATCGT TCCTCACGGC 150 0 

GACCATTACC ATTACATTCC TAAGAATGAG TTATCAGCTA GCGAGTTAGC TGCTGCAGAA 1560 

GCCTATTGGA ATGGGAAGCA GGGATCTCGT CCTTCTTCAA GTTCTAGTTA TAATGCAAAT 162 0 

CCAGCTCAAC CAAGATTGTC AGAGAACCAC AATCTGACTG TCACTCCAAC TTATCATCAA 1680 

AATCAAGGGG AAAACATTTC AAGCCTTTTA CGTGAATTGT ATGCTAAACC CTTATCAGAA 1740 

CGCCATGTGG AATCTGATGG CCTTATTTTC GACCCAGCGC AAATCACAAG TCGAACCGCC 1800 

AGAGGTGTAG CTGTCCCTCA TGGTAACCAT TACCACTTTA TCCCTTATGA ACAAATGTCT 1860 

GAATTGGAAA AACGAATTGC TCGTATTATT CCCCTTCGTT ATCGTTCAAA CCATTGGGTA 192 0 

CCAGATTCAA GACCAGAAGA ACCAAGTCCA CAACCGACTC CAGAACCTAG TCCAAGTCCG 1980 

CAACCAGCTC CAAGCAATCC AATTGATGAG AAATTGGTCA AAGAAGCTGT TCGAAAAGTA 2 040 

GGCGATGGTT ATGTCTTTGA GGAGAATGGA GTTTCTCGTT ATATCCCAGC CAAGGATCTT 2100 

TCAGCAGAAA CAGCAGCAGG CATTGATAGC AAACTGGCCA AGCAGGAAAG TTTATCTCAT 2160 

AAGCTAGGAA CTAAGAAAAC TGACCTCCCA TCTAGTGATC GAGAATTTTA CAATAAGGCT 2220 

TATGACTTAC TAGCAAGAAT TCACCAAGAT TTACTTGATA ATAAAGGTCG ACAAGTTGAT 2 2 80 

TTTGAGGCTT TGGATAACCT GTTGGAACGA CTCAAGGATG TCTCAAGTGA TAAAGTCAAG 2340 

TTAGTGGAAG ATATTCTTG 2 3 59 
(2) INFORMATION FOR SEQ ID NO: 2 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1052 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

TTCTTTCTGC TATAATCGTA TAAAATACTT ACTTTAGGAG TTCTTATGAA AGTTGTTAAA 60 

TTTGGAGGTA GTTCTCTTGC CTCTGCTAGT CAATTAGAAA AAGTTTTAAA CATCGTCAAA 12 0 

AGCGATTCAG AGCGTCGTTT TGTAGTCGTT TCTGCGCCTG GTAAACGCAA TGCTGAAGAT 180 
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ACTAAGGTTA CGGATGCCCT GATTAAATAC TACCGCGACT ATGTTGCGGG TAACGATATT 240 

AGCAAGAACC AAAGCTGGAT TATCGACCGC TATGCTGCTA TGGTTAGTGA ATTGGGACTA 300 

AAACCAGCTG TGCTAGAAAA AATTTCTAAA AGCATTCACG CCTTGGCCAC TCTTCCTATT 3 60 

GAAGAAAATG AATTTCTCTA CGATACTTTC CTAGCAGCCG GTGAAAATAA CAATGCCAAA 42 0 

TTGATTGCTG CCTACTTTAA CCAAAATGGT ATCGATGCAC GCTATATGCA CCCTAGAGAA 480 

GCTGGGATTG TGGTCACAAG TGAACCTGGT CACGCTCGCA TCATTCCATC AAGTTATGAC 54 0 

AAGATTGAAG AATTGACAAA CACCAATGAA GTCCTTGTCA TTCCTGGTTT CTTTGGTGTC 600 

ACTAAGGAAA ATCAAATCTG TACTTTCTCA CGTGGAGGTT CTGATATTAC AGGTTCTATC 660 

ATTGCTGCTG GTGTCAAAGC TGACCTCTAT GAAAACTTTA CGGACGTTGA TGGTATCTTT 72 0 

GCAGCCCACC CTGGTATTAT CCACCAACCA CACTCGATTC CTGAGTTGAC CTACCGTGAA 780 

ATGCGCGAGT TGGCCTATGC AGGCTTCTCA GTCCTTCATG ACGAGGCTCT TCTTCCTGCC 84 0 

TACCGTGGAA AAATTCCTCT GGTTATCAAG AATACCAACA ACCCTGACCA TCCAGGTACT 90 0 

CGTATCGTTC TAAAACACAG TAATGATGAA TTTCCACTTG TGGGAATTGC TGGTGACTCA 9 60 

GGCTTTGTCA GCATTAACAT GTCGAAATAC CTCATGAACC GTGAGGTTGG ATTTGGCCGC 102 0 

AAGGTTCTGC AAATCCTGGA AGAACTTAAC AT 1052 
(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 855 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 245: 

CCCTCGAAAA CTAAGCCGAT GAAGTCAGAA CACTTCAATC CTGTTCGTGA CTGGTGGGAA 60 

AATCGTGAAG AGATTCTGGA AGGTAAGTTC TACAAATCTA AATCATTTAC ACCTAGTGAA 12 0 

TTGGCTGAGT TGAATTATAA TTTAGACCAG TGTGACTTTC CAAAAGAGGA AGAGGAAATC 180 

TTAAATCCCT TTGAGTTGAT TCAGAATTAT CAAGCGGAAA GAGCAACTTT AAATCATAAG 240 

ATTGATAATG TATTAGCTGA TATTTTGCAG TTGTTGGAGG ACAAATAATG ACACCAGAAC 300 

AACTTAAAGC AAGTATTCTC CAAAGAGCGA TGGAAGGGAA ATTAGTGCCG CAAAATCCCA 360 

ATGACGAACC TGCAAGTGAA TTATTAAAGA GAATTAAAGC TGAAAAAGAA AAACTTATCA 420 

GTGAAGGAAA AATCAAACGA GATAAAAAGG AAACTGAGAT ATTTCGTGGT GATGATGGGA 4 80 
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AACATTATGG GAAGTTTGCT GATGGAAGCA CTCAAGAAAT TGATGTTCCT TATGATATTC 54 0 

CTGATACTTG GGAGTGGGTG AGGATAAAAT CAATTTATTG GAATTTTGGG CAAAATAAGC 600 

CAGAGAAATC CTTTAGGTAT ATAGATACGT CTAGTATTGA TACAAAAAAG AACATAATCA 660 

ACTACAAAAA TCTACAATAT CTTTCACCTG AACAAGCGCC TTCCCGTGCT AGAAAATTAG 72 0 

TTTCGCAGAA TAGTGTCTTA TTTTCAACAG TTAGACCATA TCTAAAAAAT ATTGCTGTAG 780 

TTAGAGAACT TAAAGAGTAT TTGATAGCTA GTACAGCATT TAATGTTTTG GGATACTTTA 840 

CTTAACGAAA CAT AT 855 
(2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 
TTTAGGAAGG CTATCCGTAA TTTTACAAAG GATTTAGATA TTACAGAGGA ACATTTAGAT 60 

ATTATCAAAA GAGAGATGTT TGGCGAATTT TTCAGTAGCA TGAACTCTCT TGAATTTATT 120 

GCAACGCAAT ATGATGCTTT TGAAAATGGT GAGATAATTT TTGATTTGCC GAAAATTTTA 180 

CAGGAAATTA CTTTAGAGGA TGTCCTTGAT GCTGGACATC ATTTAATAGA TGATGGTGAC 240 

ATAGTTGATT TTACAATATT CCCATCGTAG TAACCTATTA TAATAGACAC TAGAAAGAAG 3 00 

GGATGACAAG TATGAGAAAA AAAACAATTG GAGAGGTTTT ACGATTAGCT AGAATCAATC 3 50 

AGGGATTGAG TTTAGATGAA TTGCAGAAAA AGACAGAAAT CCAGTTAGAT ATGTTGGAAG 420 

CAATGGAAGC AGACGATTTC GATCAACTTC CAAGTCCTTT TTACACGCGT TCTTTCTTGA 4 80 

AAAAATATGC ATGGGCTGTT GAGTTAGATG ACCAAATTGT TTTGGATGCT TATGATTCTG 54 0 

GGAGTATGAT TACTTATGAG GAAGTAGATG TTGATGAAGA TGAGTTGACA GGTCGTAGAO 600 

GTTCAAGTAA GAAAAAGAAG AAAAAAACAT CATTTTTACC TTTATTTTAT TTTATCCTGG 6 60 
(2) INFORMATION FOR SEQ ID NO: 247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1805 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 
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CCGGTTGCAC AGGATCGTGC ATAGTCAACT CTTCAAGTAT AGCATATCTC CTATTTTCTT 60 

ACAAGTAATA ACACCTAAAA TGAAGCTTTT TCTTTTACTT TTTTCTGCCA AGAGGCAAAA 120 

AGCATGCTGA GGTAAAAAAC GCTCATCATA ATAGGAACAC CAAGAATGGT CTTTTCATGA 180 

TAGAAAATCG TCAAATAGGC TGAAAAGACA ACGCCAAGGA CAAAACTACT AAGCAGGCTA 240 

ACAAATATGA ATCCTTCACG CAAAAAAGGA GTGTGCTTGG TTCGGAAATA ATCTCCAAAA 3 00 

GCCAGCATGG TCCGTTTGAT ATTCCCTGTC ATAAAAGCGT TATTATAGGC AATACCCGAC 3 60 

ACTTCTCCAA AAGCAGTTGT CACCAGTCCC ATACAGAAGG CCAAGGGCGG CACTAGATAG 420 

ATATTATCCA CAGTTTGCGG CACAAAAGCA ATAATGATTG ATAAGATTGC CAAGGGAATC 480 

AAGGACAGAA TAGGTTTTTT CACAATTCTC AATTTTTCCT TATAAATCGT TAATAAAAAG 540 

ACTCCCATCA TAAACGCTAG CAAGGTGAGA ACCTTGTCCC TAACATCCGA AACATTATTT 600 

TTAATTAATT CTACTGAAAG AAAGACAACA TTTCCAGTTT GTCCAGCTAC AAGGGTATTC 660 

CCGCGAACAA TAAAAGTGTA AGCATCCACA TATCCAGCAC AAAACGTCAA AAAAAGTGCT 72 0 

AACCTTTTAG ACTGACGTGA TATTTTTCTT ATAGGTAATA ACCTCATTTT ACCTCCCATT 780 

GTATTTTCTC TTAGAAATAT TGTACCATTT TCTTTCTAAA AAATCGTAGG CTACCATTTA 840 

GATTTTACTA TTAGCATAAA AATAATAATA GACAACTATT TATCCAAAAA TAGATAGATG 900 

TAACATGTTT GCAAACAAAG CATACGAACC TTTAGTAAAA TCATTTCCAT GAAACTAGAA 960 

TAGAGCCCTC TTAGCAAAAA TCATTATTTT AATTTATTTC TAATCACTCC TTGACATAAA 1020 

TAACTCTCAC CAATAAAAGA CTATGTCTTA AAAAAATGGT ATAATAAAAT CAATACTTGG 1080 

GCTTGATGGC TATGCTACTA ATAACAATTA GGAGAGAAAA TCAGGCACTT GTTAACAACA 1140 

AGGATTATCC CCTTGAGATG AAAGGAACTT TAGAAATCTT ATGATGAACA TGCAAAACAT 1200 

GATGCGTCAA GCACAAAAAC TTCAAAAACA AATGGAACAA AGCCAAGCTG AACTTGCTGC 12 60 

TATGCAATTT GTTGGCAAAT CTGCTCAAGA TCTTGTCCAA GCGACCTTAA CTGGCGATAA 1320 

GAAAGTTGTC AGCATTGATT TCAATCCAGC TGTCGTTGAC CCAGAGGACC TTGAGACTCT 1380 

TTCTGATATG ACCGTTCAAG CCATCAACTC TGCTCTTGAA CAAATCGATG AAACTACCAA 1440 

GAAAAAACTG GGTGCTTTCG CTGGGAAATT ACCTTTCTAA AAACAAGGAG CTAGAACAAT 1500 

GCTTGTCGAT AACAAAGGCT AAGAAAGGTG CAAAAATGAC TCTATAATAT TTGTAGTGGG 1560 

TAAATCCCCT ATGGATATTA TGGAGCCTAT TTTTGTGTAG AAAAAAGTCC CATATGACCT 1620 

ATAATGAAAA GCGACAAAAC AACTCATTAG AAAGAATCAT ATGGAACAAT TACATTTTAT 1680 

CACAAAATTA CTAGACATTA AAGACCCTAA TATCCAGATT TTAGACATCG TCAATAAGGA 1740 
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TACACACAAG GwAATCATCG CCAAACTGGr CTATGAAGCT CCATCTTGTC CTGAGTGCGG 1800 
AAGTC 1805 
(2) INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2516 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 

CTGCATCTAG TTTGTTTCTC CCTACAGTTT TAGCTAGACA GATTGGAGAT TATGATTTAA 60 

CGTCGCCGCG TTGGGGTTCG GATACAACTA GTGAGCTTGA GAAAGAAAAC TCCTCTGCTG 120 

GAATTAATAA TAATGACAGC ACTGGTGGCG GTAAAAGGTT AAATACCTCT ATTCGTAGCG 180 

CCTATAGTGG GTCAGATATT ACCCCGGTAT ATTCATTGGG GTCTGGCTCT AGGATTGTCA 240 

TGTACTATAA TGGAGGTGGT GACAATTATA TTGGTTCTGG TACTAGATTA GCTATGGCGC 300 

CACAATTTGG AAATCATGTA AGAATTCATA CTTCAGGTTC TTGGAATCCA GATTCTTATT 3 60 

AACTTACTTG TCAGAGTAAG CCTTAAAGAT GGTTGATTGT GGGTGTAGCA TGAAAAAAGA 420 

ATGCTACACC CTATTTTTAT TATAAGGAGG AGTAAGGATG GAATTTTTCA TTTGTAATCT 480 

TGTACGAGTC GTTCAATCAC CTCGATTTTA TATGTCTTTA TTTTTGACCC TTCTTTGCAT 540 

GAGTTTAGGA AATTTCCTTG CTTTCAATGG TATTTATAAA ATTGAAGGTT TATCGATTTT 600 

TTTTGCCGCT TCTTCTATTC GAGGATTTTC ACCGATTAGC CTAGTAGCTG CACTTATCTG 660 

TACACTGCCC TATTCTAGTC AGATAATAGA GGATGCTGAG AGTCATTTTC TAACAGCACA 720 

ATTGTGTCGA ATTTCTAAAA AGAAGTATCT GGCTATTGTG GGTAGTACTG TAATTATTTC 780 

TTCTTTTCTA GTCTTTTTTC TCCCCTATTT ATTATTATTA GGAATTAATC TTTTAGTGAC 840 

TCCTTATCAG GAAATTTATA TTGGAGATTA TAGTGGTGCC TTAAAAGAAT TATTTGATTC 900 

CAATCAGTTT CTCTATAGTC TTGTAACGAC TCTCTGGTAT GGAGTTTGGG GCGCTGTGTT 960 

CTCTATTTTT GGACTAGCTA GTGCTTTGCT AGTGAAGAAA AAAATAGGAG CTATTTTCAT 102 0 

CCCAGTTGCC TATATGATGG TTGGTGGTAT TTTTTGGGCT ATTTTAGGGC TATCTTACTT 1080 

AGAACCTGTG ACAACGCTAG CTTTGGGATA TCAGAAAGAT ATCAGTCTTT CCTTAGTTAG 1140 

TGCTCATCTT GCTTTTATTT TATTTGTTAG TTGTTTGGTT GTTTATGGTA CATTTTTTCT 12 00 

AC AT TC AG AG GACTATGTAT AATGAAACAA TTTGTTCAAT TTTATAAAAA AGATTTCTTA 12 60 

GCAGTATTGG TTTATTTTAT ATTACTGCTA TCCTGTGTTT TATCTAGTAC AGTATATTTA 132 0 
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TTGCGCtGTC GCCAATATTC AATCCATCCA AATGTATTAG AATGGATCTT AGTTTTACTT 1380 

CAAGATATGA CGACTGGAGT ATATTGCTTT CCGTTCACAT ATATATTGTT CTTTTTTTAT 1440 

TTGATGAATA ACTATTTTAA TAGGTTGGAG TGTCGCATTC GTCTGAAATC AATTAAGCAC 1500 

TTTACCAGTT TTAGTTTCAA ATTAGCAGCT CTTAGTACGG GGATTTGGAC GGCGACTTTA 1560 

TTTTTATTGA TTTTTCTAAT TGCATTTAGT AATGGTTTTA GCTTCTCTTT GGAGATAAAG 162 0 

GAGGTTGATT TTTTAAGAGA ATTTTATGGT ATAAGTATTG CAAACAATGC TAGTTTCTTT 168 0 

ATAGGATTTT TTTTCTCTTA TATAGCATAC TATTTCTTTT TATCCTTACT TACTATTAGC 1740 

AGTTTTTCTT GGTTTAAAAA ATCAAACATG AGCTTAGTAT TTCTGTTTAC TTTTTTATTT 1800 

GTAGAATCCT TATTCTGGAT TTATCAGTTG GACAATGGGA TAATTGGATT ATTGCCAATT 18 60 

TTTCAGTATA TGGTAAATTC CAATCCGTAT GCATTGATTT ATTGGCTTAC ATTACTATCT 192 0 

ATCATAATTC CATTGACTGT ATTTTCTGTT CATAGAAACT GGAGGAGAGT GTAAAAGTTG 19 80 

GAAATGGGAA AGTTAAGTAG TCACATGTGG AGGTTGAATC AG AT AAT C T A TACCAAGTAC 2 040 

TTTTGGGGTT ATGTTCTTTT TTGGATATTG ATTTGTTTAG GATTATGGTA TTGGTTAGAA 2100 

GGAAATGATA GACTTGTTAT AGAAATTTTA AAAGGGCCTA ATCTGAGTCA AAACTCTTTT 2160 

TTAGTCTTAT CTATATGGTT GCTTCATTGG TTTATTATTC ATACATTTTT TCTAGCAGTT 2 220 

GTATATCGTA GAAGAGCATC CGATTTCTTT ATGGAAGTGA TTCGATTTTC TTCTATTAAG 22 80 

CTCTGGATTA GGTATCAGAT TTGGACCTGT TTTCTTTATG GACTCATTTT AATCATGGTA 2 340 

AAAGTTCTAG TGATTCAATT TATGTTACAG TTACCAAACT GGGATATAGG AGTTTTGTTT 2 4 00 

ATAGTTGATT CTTTGAATGC TTGTGTGTTA GTCTTGTTTT GCTTTATGTT ATACGCACTA 24 60 

GGAGCGAATG TACAAATGAA CTTTGCTTGC GTTAGTTTCT TTTTACTCAT GATTGG 2 516 
(2) INFORMATION FOR SEQ ID NO: 249: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1364 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 

CGGTGTTTTT TTGTAAATTT TCTAGCACTT GTATGGTAAA ATAGATACAG GTGTTCATTA 60 

AACTAGACTA AAAACCTATT TAAGCAGGCA AAATGAAGAA ATACCAACAA TTATTTAAGC 120 

AAATCCAAGA AACCATTCAA AACGAGACTT ACGCTGTCGG AGATTTCCTT CCTAGCGAGC 180 



WO 98/18931 



PCT/US97/19588 



1272 

ACGACCTTAT GGAGCAATAT CAAGTGAGTC GTGATACCGT CCGAAAGcCC TGTCTCTCCT 24 0 

CCAAGAGGAA GGATTGATCA AAAAGATAAG AGGGCAAGGT TCTCAAGTCG TCAAAGAAGA 300 

AACCGTCAAT TTCCCTGTAT CCAACCTAAC CAGCTACCAA GAACTAGTTA AAGAACTTGG 360 

ACTGCGCTCT AAAACCAACG TGGTCAGTCT GGACAAGATT ATTATTGATA AAAAATCCTC 42 0 

ACTGATAACC GGTTTCCCAG AGTTTCGGAT GGTTTGGAAG GTGGTCCGCC AGCGTGTGGT 480 

GGATGATCTG GTATCCGTTC TGGATACGGA CTATCTGGAT ATGGAACTCA TCCCAAATCT 540 

CACTCGCCAA ATTGCTGAGC AGTCTATCTA TTCTTATATA GAAAATGGCC TCAAACTCCT 600 

TATTGATTAT GCTCAGAAGG AAATCACCAT TGACCACTCA AGCGACCGAG ACAAGATTCT 660 

CATGGACATT GGCAAAGACC CTTATGTCGT TTCGATTAAA TCAAAAGTCT ATCTCCAAGA 72 0 

CGGACGCCAA TTTCAGTTTA CCGAAAGTCG CCATAAGTTA GAGAAATTTA GATTTGTAGA 780 

TTTTGCAAAA CGCAAGAAAT AAAAGACTGA GACACCAGAT CTCAGCCTTT TTCGGCTCTA 84 0 

TAATATTTGT AGTGGGTAAC CCCCCTATGG ATATTATGGA GCCTATTTTG TGTAGAAAAA 900 

AAGTCCCATA TGACCTATAA TGAAAAGCGA CAAAACAACT CATTAGAAAG ATTCATATGG 960 

AACAATTACA TTTTATCACA AAACTGCTCG ATATTAAAGA CCCAAACATC AAGATTCTAG 102 0 

ACATCATCAA TATGGATACC CACAAAGAAA TTATCGCTAA GCTGGATTAT GAGGCTCCAT 1080 

CTTGCCCTGA TTGTGGAAGT CTAATGAAGA AATATGACTT TCAAAAACCG TCTAAGATCC 1140 

CTTACCTCGA AACAACTGGT ATGCCTACTA GAATTCTCCT TAGAAAGCGT CGTTTCAAGT 1200 

GCTATCATTG TTCTAAAATG ATGGTCGCTG AAACTTCTAT CGTCAAGAAG AATCATCAAA 1260 

TTCCTCGTAT TATCAACCAA AAAATTGCGC AAAAGTTGAT TGAGAAGATT TCTATGACCG 132 0 

ATATTGCTCA TCAGCTGGCC ATTTCAACTT CAACTGTCAT TCGG 13 64 
(2) INFORMATION FOR SEQ ID NO : 2 50: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 

CCATGAAGAC CGCTTGGAAT TGGAATGGCA CAAGTCTTTG TTGAATGGTC TATTCCCATT 60 

GACAATCGGT GGAGGAATTG GACAATCTCG TATGGCCATG TTCCTACTTC GCAAGAGACA 120 

CATCGGAGAA GTGCAAACAA GTGTTTGGCC TCAAGAAGTC CGCGATACTT ACGAAAATAT 180 

TTTGTAGAGA ATCGAACCGC AAGGTTCGGT TTTCTTTCTC TTTTTGTCTA TAATTTGGTA 240 
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TAATAAACAG TATGAAAATC GTATCAGGAA TCTATGGGGG ACGTCCCCTC AAGACACTAG 3 00 

AAGGCAAGAC GACAAGACCT ACTTCGGATA AGGTTAGGGG AGCCATTTTT AACATGATTG 3 60 

GTCCCTACTT TGAAGTGGGA CGAGTCTTGG ACCTTTATGC AGGTAGTGGT GGTTTATCTA 42 0 

TCGAAGCAGT ATCGCGTGGC ATGTCCAGTG CTGTTTTGGT GGAGCGAGAC CGTAAGcTCA 480 

GACCATCGTG GCTGAAAATA TCCAGATGAC CAAGGAAGTT GGAAAATTTC AACTCCTCAA 54 0 

GATGGATGCA GAAAGGGCAT TGGAACAGGT ATCTGGGGAA TTTGACCTCG TTTTCTTAGA 600 

CCCTCCCTAT GCCAAGGAAC AAATCGTAGC AGATATTGAA AAAATGGCTG AGAGAGAGCT 660 

TTTTTCTGAA GATGTTATGG TTGTGTGCGA GACGGATAAA GCCGTTGAAC TTCCAGAAGA 72 0 

AATTGCCTGT CTGGGTATCT GGAAGGAAAA GATTTATGGA ATTAGTAAGG TGACAGTCTA 7 80 

TGTCAGATAA GATTGGCTTA TTCACAGGCT CATTTGATCC GATGACAAAT GGGCATCTGG 84 0 

ATATCATTGA ACGGGCGAGC AGACTTTTTG ATAAGCTTTA TGTGGGTATT TTTTTTAATC 9 00 

CCCACAAACA AGGATTTCTC CCTCTTGAAA ATCGTAAACG GGGGTTAGAA AAGGCTGTGA 9 60 

AACATTTGGG AAATGTTAAA GTCGTGTCTT CTCATGATAA ATTGGTGGTC GATGTCGCAA 102 0 

AAAGACTGGG GGCTACTTGC CTAGTGCGAG GTTTGAGAAA TGCGTCGGAT TTGCAATATG 10 80 

AAGCCAGTTT TGATTACTAC AATCATCAGC TGTCTTCTGA TATAGAGACT ATTTATTTAC 1140 

ATAGTCGACC TGAACATCTC TATATCAGTT CATCAGGCGT TAGAGAGCTT TTGAAGTTTG 12 00 

GTCAGGATAT TGCCTGCTAT GTTCCCG 122 7 
(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 

CCGGTCAAGT T AAAAACG C T ATTTCTTCCC ATTTTATTTA TTTTTTAGGA GTGGTAACGT 60 

ATCAAAATAG CCCAAGCGTT CTCACCCGTG TGAGTTTGAA TAATGGAACC CGTTTCCAAA 12 0 

ACAGAAATTG GCTTTTCAAC ATAAGCTTGT AAGCTTTCTT TCATCTCTTT TGCCCAATCA 18 0 

TCACTACCAG AATATGAAAT TCCAATCTCT GCTACAGCAC GTTCAGAAAG CGATGTTATC 24 0 

AACTCATCTA ACCATTTTTT AAATGTTTTA GTTCCACGAC CTTTAACCAT TGGCTGCAAT 300 

TCATGGTCTT TCATTTGCAT GACAGCACGG ATATTGAGAA GAGAGCTCAA CAAGCCAGTT 3 60 
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ACACGGCTAA TTCGTCCACC TTTGACAAGA TTTTCCAAAG TTGAAACACC AATATAAAGC 42 0 

TCTGTATGGT TTTTAACCTC TTCTACATGA GATAAAATTG CCTCCATATC TTTACCTTCT 480 

TGAGCTAACT TCGCAGCCTC AACAACTTGG AATTTCAGGG CTTGGTCAGT GAAGGAACTA 540 

TCAACAACAG TCACATCTGC AGTAGATAGG CTAGCACCTT GGCGTGCTGC TTCTACCGTA 600 

CCCGAAAGAG CATGGGACAT ATGAATAGCA AGAATCTGGC CACCATCTTT GCATAGGTCT 6 60 

TCAAAAATCT CAGCAAAGAC ACCTACAGGT GGCTGACTTG TTTTCGGAAG ATTCTTACTT 72 0 

TCTTGCATCA AC T G AAG AAA TTTACCTTCT TCTTTCAAAT CCGCATCAGA ATAAACAACA 7 80 

TTATCAATCA TTACAGATAA TGGAACAATT GTAATATCTA ATTGCTTTAC TAGTTCAGGT 840 

TCAATAGTAA CAGATGAATC GGTTACAATC TTAATTTTTG TCATAGTATC AATCTTTCTA 900 

TTTTAGGATT CAGATTGGTT TCCTTACTTC TAATTATATC AAAAAAAAGA TTAAAAATCC 960 

TAATGGAGTC AATCAAATTT TCCGTAAAAT TTGATATAAT CAACTTATAA GAAAAGAGGT 102 0 

GTCCTATGAT TAAAAAAATT TACCCCATTT TTACCATTTT ACTAGGTGCT GCTATTTATG 108 0 

CTTTTGGACT GACTTATTTT GTAGTTCCCC ATCATCTCTT TGAAGGAGGG GCGACAGGCA 114 0 

TTACCCTCAT CACCTTTTAT CTTTTTAAAA TCCCTGTTTC CCTCATGAAC CTGCTGATTA 1200 

ATATTCCCCT TTTCATCCTA GCTTGGAAGA TTTTTGGAGC CAAATCCCTC TATTCTAGTT 1260 

TACTAGGAAC CTTAGCTTTG TCCGGCTGGT TAGCTTTTTT TGAGCATATT CCCCTTCATA 132 0 

TTGATCTTCA AGGTGATTTA CTAATCACAG CCCTTATAGC GGGAATCCTA TTGGGAATTG 1380 

GCCTTGGAAT TATTTTTAAT GCTGGAGGTA CAACTGGCGG AACTGATATT CTAGCTCGTA 144 0 

TTCTCAACAA ATACACTCAT ATATCCATAG GAAAACTGCT CTTTATCTTA GATTTTTGTA 1500 

TTCTCATGTT GATTCTCCTA ATCTTCAAGG ATTTGAGATT GGTTTCCTAC ACGCTTTTGT 1560 

TTGATTTTAT TGTTTCTCGT GTTATTGATT TGATTGGTGA AGGAGGATAT GCCGGCAAAG 162 0 

GCTTTATGAT TATCACAAAA CGTCCTGACC AACTTGCTAA GGCGATTAAT GATGACCTCG 168 0 

GAAGAGGTGT TACTTTTATT TCTGGTCAAG GCTACTATAG TAAAGAAAAT TTGAAAATCA 1740 

TCTACTGTAT TGTCGGAAGA AATGAAATTG TGAAAACGAA GGAAATGATT CATCGAATCG 1800 

ATCCTCAAGC CTTTATAACT ATTACAGAAG CCCATGAAAT CCTAGGAGAA GGCTTCACCT 1860 

TTGAAAAAGA ATAAAAAGAG GTAATGTCGT GACCTCAAAA GTTAGACTAA ATCATCTATC 192 0 

TTTTGGGTTA CAGACAACCT CTTTTTTATT TTATTTACTC AAGCTCTTAA GACCAATTCC 1980 

GAGTTACTTC TTCATCAGCC TTTAACTGAT CCACTAATTG GTCAACTGAG TCAAATTTGG 2 040 

TCATATCTCG AATGCGATCA AGCCAATAAA CCATGACGGT TTCCCCATAA ATATCTTGAT 210 0 

TAAAATCAAA AATATTGACT TCAAAACGTG CTTCTTCTCC ATCAAAGGTC ACATTTTTCC 2160 
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CGACACTAGC CATAGCACGA TACTTCTGTC TTTGAATCTC AACATCAACA ACATAAACGC 22 2 0 

CATCTGCTGG CATATAAGTA CGGTCTAAAA GCACTAAATT CGCTGTCGGA TAACCAATTG 22 8 0 

TACGACCACG AGCATTACCA TGAACCACCA TACCTCTTGA TGGAAGCGGT GCCCCCAAAA 2 3 40 

GTTTTCCTGC TTCTTTCACA TTTCCATCTA AAATAGCTTG ACGGATACGA GTTGAACTAA 2400 

TCTTTCCTTT CTCATCTTCT ACAGGTGGAA CAATGATAAC TTCTCCATCA AAGTAATTCT 24 6 0 

TTAAATCTTC TGCTGTTTTT TTGTCAGAAC CAAATGTATA ATCAAAACCT GCAACAATAA 252 0 

TTTTGGCATT CATAGCCTTG ATATAAGTTG CAAAGAATTC TTGTGCAGTG AGACTAGCGA 2 580 

ATTGACTACT AAAATCAAGG AGATATAATT CTTCTACACC TTCGCGCTTT AATTTTCTTT 2 64 0 

CACGTTCAGC AGGGTTCAAA ATATGCAAAA ACAAATCTGG ATGATAAGGC TCTAAAGCGA 2700 

TCTTTGGAGA TTCATTAAAG GTCATAACGA CGATAGGCAA CAAATCCTTT CTCGCAGCCT 27 60 

TGTTGGCAAC ACGAAATAAT TCTTGATGCC CCTTATGTAT GCCATCAAAA TAGCCGAGAA 2 82 0 

CAACGACTGA ATCAGATGGT GTGCCAATAT CTTTTTGGTT TTTTATACGA ATAGTAATAA 2880 

TCATAAAATA ATTATATCAT AGCGATAGCT ATTTCTGGAA CAGAAAATCT GAAATGTTGT 2 94 0 

TTTTTTCACA TGAAGTGTAC CTGTTTTCAA AAAGCACTTT ATTCTATCGT TGCTTAACTA 3000 

TGAACTTTGC AATATTCTTC TCAAAAACTT GTAGGACATC TTCAAAATTT TGCAAGGAGT 3060 

GATTAGACTT GTTCGGTAAC CATAAAGTGT CATACTATGC TTATGTATGA AAAAGCAATG 312 0 

CAACTAACTC CTGAGAACTT TAAATTACTA ATTGGTGCCG AAAAGGTAGA ATTTAGAATC 3180 

GAGGTACACC TATGGCTGTA AAATTTACAA AATGAGACAA CTTGGGCAAG ATGTTTGAAG 3 240 

AATTTCCTAA ACTCCCTGAT TTGAAGCAAG TCACTTTCCC TAATGACAAA GAAAAAAGCC 3 3 00 

AAAACAGCAA AGAAAAACTA GATGACTGCT TTCCAACAAC TCCCATCTAG TGTGCTTCAG 3360 

ACTGGGCTAT TTTTCTCTCC ATCTGTTAGC TTGGATTCTC AGACCGTTTC AGCTAAAGAA 3420 

TATCTTTTCC CTTATCAGAA GGAACGGCTC AAGCCATTCA GACAAGTGAA GGGACGACAA 3 480 

GCCAATATTT GAAACCAGAT AGCAGTTCTT ATAGTCAATT GAAATAAAAT CTGAAGAAAT 3 540 

CGAGTAGGAA ACTCATATCA ATGTTTAACA GTGTTCTATT CCAGATTCAT ACTCAATGAw 3 500 

AATTAAAGTG CAAACTAGGA AGTTAGCCGC AGGTGATACT TTGGGTACGG CA 3 552 
(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 743 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 

GTACCGTGGT GCCAAAGTAC AGCAAGGTTG GCTTTTTGAC AAACAATACC AATCTTGGTT 6 0 

TTACATCAAA GAAAATGGAA ACTATGCTGA TAAAGAATGG ATTTTCGAGA ATGGTCACTA 12 0 

TTATTATCTA AAATCCGGTG GCTACATGGC AGCCAATGAA TGGATTTGGG ATAAGGAATC 180 

TTGGTTTTAT CTCAAATTTG ATGGGAAAAT GGCTGAAAAA GAATGGGTCT ACGATTCTCA 24 0 

TAGTCAAGCT TGGTACTACT TCAAATCCGG TGGTTACATG ACAGCCAATG AATGGATTTG 3 00 

GGATAAGGAA TCTTGGTTTT ATCTCAAATC TGATGGGAAA ATAGCTGAAA AAGAATGGGT 3 60 

CTACGATTCT CATAGTCAAG CTTGGTACTA CTTCAAATCC GGTGGTTACA TGACAGCCAA 42 0 

TGAATGGATT TGGGATAAGG AATCTTGGTT TTACCTCAAA TCTGATGGGA AAATAGCTGA 480 

AAAAGAATGG GTCTACGATT CTCATAGTCA AGCTTGGTAC TACTTCAAAT CTGGTGGCTA 540 

CATGGCGAAA AATGAGACAG TAGATGGTTA TCAGCTTGGA AGCGATGGTA AATGGCTTGG 600 

AGGAAAAACT ACAAATGAAA ATGCTGCTTA CTATCAAGTA GTGCCTGTTA CAGCCAATGT 66 0 

TTATGATTCA GATGGTGAAA AGCTTTCCTA TATATCGCAA AGTAGTGTCG TATGGCTAGA 720 

TAAGGATAGA AAAAGTGATG ACA 743 
(2) INFORMATION FOR SEQ ID NO: 253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 253: 

TTTTGGTTGA TGATACGAGG GATTTGGTGA TTCTTCTTGA CGATAGAAGT TTCAGCGACC 60 

ATCATTTTTG AACAGTGATA GCACTTGAAT CGACGCTTTC TAAGGAGAAT TCTAGTAGGC 120 

ATACCAGTCG TTTCAAGATA AGGAATTTTA GAAGGTTTTT GAAAGTCATA TTTCTTCAAT 180 

TGGTTTCCGC ACTCAGGGCA AGATGGGGCG TCGTAGTCCA GTTTGGCGAT GATTTCCTTG 240 

TGTGTATCCT TATTGATGAT GTCTAAAATC TGGATATTAG GGTCTTTAAT GTCTAGTAAT 300 

TTTGTGATAA AATGTAATTG TTCCATATGA TTCTTTCTAA TGAGTTGTTT TGTCGCTTTT 360 

CATTATAGGT CATATGGGAC TTTTTTTCTA CAATAAAATA GGCTCCATAA TATCTATAGT 420 

GGATTTACCC ACTACAAATA TTATAGAACC GAATTAATTT AATTAGAGAG CCAACTTTCT 480 

AATATAGTAA TCGCGTCATA ACAAGGTATC TATCATTCAT GGAGTTCCTC CTGTATACTA 540 



WO 98/18931 



PCT/US97/19588 



1277 

TTAGTAAAGT AAAACTATTG GAGGATATTT TAATGCCACA ACCTATTGTT CCTGTAGAGA 600 

TTCCACAATC TCGTCGTTTT GATTCTAAAA AGAGAAATGA TATTCTGCTT AAAATTCGTA 660 

TTGGCAAGCT TGAAGTAAGT TTTTTTCAAT CTCTCAATCT CGAAATGGTA GAACAGCTTT 72 0 

TGGATAAAGT GTTGCTCTAT GACAATTCAT CTATCTAGCC TAGGGCAGGT CTATCTCGTA 7 80 

TGTGGGAAAA CGGATATGAG GCAAGGCATT GATTCATTGG CTTATCTGGT TAAAACCCAC 840 

TTTGAATTAG ATCCTTTCTC CGGTCAAGTT TTTCTCTTTT GTGGTGGACG TAAAGACCGC 900 

TTTAAAGCCC TTTACTGGGA TGGTCAAGGA TTTTGGCTAC TATATAAACG CTTTGAGAAC 960 

GGAAAACTGA CTTGGCCCAG TACAGAAAAG GATGTCAAAG CTCTCACACC TGAACAAGTA 1020 

GATTGGCTTA TGAAGGGCTT TTCTATCACT CCAAAAATAA ATTTATCAGA AAGTCGTGAT 108 0 

TTCTATTGAA ATGAGGACTT TCTTTTTAGT TATAATAAAG TTAGGAAATA AGGAGAGGAA 114 0 

GCCCATGGAA GAAGATTGAA AATCATTCAA CAACAGAGTG CTACAATTGA TAGTCTCACC 12 00 

AATGAACTTG CCCTTCTTCG TGAACAAGTG GCTTATCTAA CGCAAAAGCT CTATGGAAAA 12 6 0 

TCCTCTGAGA AAAGTGTTTG CCCATCTGGA CAACTCAGTC TTTTTGAAGA GGAACAAAAT 13 2 0 

ATGGAAGAAG ACTCTGACTT ACCCAGTTGA AAGAGAAGAA ATCACCTATA AACCTAAGAA 13 80 

AGCTAAAGGG AAACGTCAAG CTCTTCTTGC CCAATTTGAT TCAGAAGAAG TTCATCATCA 144 0 

AGTAGAAGAG AGCATTTGCC CTGATTGTCA GGGAGATCTA AAAGAGATTG GAGCAACCCT 1500 

TCAACGACAA GAATTAGTCT TTATTCCTGC GCAATTAAAA CGAATAGATC ATATCCAACA 1560 

CGCTTATAAG TGCCAAGCAT GCAGTGATAA AAATCCGAGT GATAAAATCG TGAAAGCTCC 162 0 

TATTCCTAAA GCCCCTTTGG CGCATAGCCT TGGCTCAGCT TCTATTATCG CTCACACCAT 168 0 

CCATCAGAAG TTTAATCTGA AGGTACCCAA TTATCGCCAA GAAGAAGATT GGGCTAAGAT 1740 

GGGTTTACCA ATCACACGTA AGGAAATTGC TAATTGGCAT ATCAAGGCGA GTCAATACTA 18 0 0 

TTTGGAGCCC CTTTATAATC TTTTACGAGA AAAGTTGTTA GAACAAGCTC TTCTTCATGC 18 60 

GGATGAAACC TCTTATCGGG TTCTAGAGAG TGATAGTCAG TTGCCTTACT ATTGGACTTT 1920 

TTTGTCTGGG AAAGCTGAGA ATCAAGCAAT CACGCTGTAC CACCATGATC AGCGTCGGAG 1980 

TGGTTTAGTA GTACAAGAAT TCCTAGGAGA TTATTCTGGC TATGTTCATT GTGACATGTT 2040 

GCGGCAGTAA CTTAGGACTT TAGTCCTCTA GTTCTGCCTA TGCGATAGCA GTCCAAGGTT 210 0 

TAGGAGTAAG GCGACGCTAA GCTTGGTAAA CTGCGAACAG CTAGAAGCTT ATCGTCAACT 2160 

GGAAGAAGCT GCACTTGTTG GATGTTGGGC GCATGTGAGA AGGAAGTTTT TTGAAGTGC'C 222 0 

CCCCAAGCAA GCAGATAAAT CATCCTTAGG AGCTAAAGGT TTAGCTTATT GTGATCAGTT 2280 
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ATTTTCCTTG GAAAGAGACT GGGAGGCTTT GCCAGCTGAT GAACGACTAC AGAAACGTCA 2340 

AG AAC AT C T C CAGCCCCTAA TGGAAGACTT CTTTGCTTGG TGCCGCCGTC AGTCAGTTTT 2400 

AGCAGGTTCA AAACTAGGAA GGGCAATTGA ATACAGCCTC AAGTATGAAG AAACCTTTAA 2460 

GACTATTTTG AAAGACGGAC ATCTGGTCCT TTCCAATAAT CTAGCTGAAC GCGCCATTAA 252 0 

ATCATTGGTT ATGGGACGGA GTAAAAGAGT CCAGTGGACT CTTTTAGCCT GAGCTCAGTT 2580 

TAAAAAAGCG AGGGTGGTTA TTTTCTCAAA GTTTTGAAGG AGCTAAAGCA AGAGCTATTG 2 64 0 

TTATGAGCTT GTTGGAAACA GCTAAACGTC ATCAATTATA GTGCGTTGAA TCTATAACAG 2 700 

TACGCATCGA CTGCTAAAAC ATTTCTATAA ATCAATTTTC CTTTCCTAAT CGATTTGTTC 2 7 60 

ATATCTTATT TCAATCCATT ATAAATAGCG AGAAATATCT ATCCTATCTT CTAGAATGTC 2 820 

TTCCAAACGA GGAAACTCTC GTAAACAAAG AGGTTTTAGA GGCCTATTTA CCGTGGACTA 2 880 

AAGTTGTACA AGAAAAGTGC AAATAAGAAA TCTCCAGATT AGGAACTATC CGTGAGTTCT 2940 

CTAGTCTGGA GATTTTTCAA TAGACTTCGT TATTGGACGG TTACAATTTA TTATATGAAA 3 000 

ATCCCATATT ATTCTCCAAT TCTATATTTT ACCTTTCTAA ATGTATAGAT TAACTACCTA 3 060 

ATTATAGCAT ATAACGCAGA TTCCTTTCAA TCGTATGATT TACTGCATTA AATTAAGTAA 3120 

AAAAATAAAG GCAGTCCGAA GACTGCCGAT ATTTATCTCT CATCTCTTTA ATTATGGTAA 3180 

GTAAATAAAT AATTTCCCTA AAGATATGGA AATTATTAAT ACTATAAATA CATATTATAA 3240 

AGTTTATAAA TACTGTAAAA ATCCTGAAGT TAATTTTCTA ATAAATATCA ATATGTGTTA 3 3 00 

GTATCTTTTA AATTTTTAGA CAATTTACTA GTTCTATAGA CATGTTTAAC AGACTCTATT 3360 

TTACAATTCA AAAATTTCAT CTGCCACTTC ATTTAAAAAT TCTATATCAT GGGAAACAAT 3420 

AAAAATTATT TTATCCATGG TTTTATACTT ATTAATCAGT TCAGATATTT TTATCATATT 3 480 

GGAATAATCC ATACCACTTG AAGGTTCGTC AAAAAAGACA AATGGAGAAT TCTTGCACAT 3 540 

AACAGATGCT ATTGCAAGCC TTTGCTTTTG CCCTCCTGAT AAACTCATCG GATGCCTTTC 3 600 

AATAAATTCG TCCAGGCATA AATCTTTTAA CCCAAATCAT TCATACCTCT CTCAACTAGA 3 660 

TGTAACTTAC AAAACCCCTG ACCTCATGAG CCACTTTCTT CCTCCTCATG AGGTCAGTTT 3 7 20 

TACTTTCTGC TGTTCCAGTA TCGTTTTTCC TCGCTAGATT TCCTCAAAAG GGCAGACTCC 3 7 80 

TCCCTTGGTT CGTCACACGA TTTTTTCATC TCGACTGTTC TTTAATGCAT CATTAACGAC 3 840 

GCTTTTCTTC TAGGTGGTTC ATAAGGAACA GGAAGATTCA GGTTGACTTT TCTAATCCTA 3 9 00 

GAATAAAGTG CTGAAAACAA TTCGGAATAG GCATAGAGAC TAGACAATTT GAGGAGCTGC 3 960 

TTGCGTCCTG TTCGAACACA TTTTCCCACC ACGTGAAGAA AAAGATGGCG 4010 
(2) INFORMATION FOR SEQ ID NO: 254: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2789 base pairs 
<B> TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 
ATGCATCCGT TTGTCAAGCC TAAATTGTAA TTTTTTTCAA TTTAAAACAG AAAAACCCAG 60 
GAAAATGACA TAAAAATATC ATTCCTAGGC CTATTTATGC TATTTCTCTC TGAAAAATAT 120 
GAGTATTCAG TCGGTCAAAT GAAGCTGAAC GAACTCATTT TCCCTCGCCT AATTCAATGA 180 
TTCGATGACA TTGTTGGGCT ACATAAGCAT CGTGGGTCAC GATAATGACT GTTTTCCCCT 240 
CTCGATTCAT CTCTAAGAGA AACTTCAAGA CCAAATCTCT ATTTTCAGGA TCCAGAGAAC 3 00 

CTGTCGGTTC ATCGGCTAAA ATCAGCTGGC TGGGTTTTAA GATGGCTCTA GCAACTGCAA 3 60 

TTCGTTGTTG TTCGCCCCCA GACAACTCGG AGACCCTTTG ATGCAAAGTA GCTGATAAAC 420 

CTACTCTCTC TAAAATCTCT TCCACCTTTT TGAGCTTGTC TTTCTTAGGC AATTTCACAT 4 80 

ATTTCAGCGC CACATGAGAT TGTACTCGAC CGTTTCATCA TCAATCAGGG CAAAATTTTG 540 

AAACAGATAA GAGATATGTT CACGGATTAT TGTTTGCGAC TTAGCACAAT TAACCGCTAG 600 

ATTTGTCTGA CCAAAAATCT CATACCGTCC GCTATAATCA CCATCTATCA AACCCAATAA 6 60 

ATTTAACAAG GTCGACTTCC CACTACCACT CTTACCAACA ATAGCTACCA AATCCCCCTG 7 20 

ATCAATCCTG AGAGATAAGT TATCCAAAAT CACTTTTCCC CCAATGGTTT TGGTAATATT 7 80 

TTTCAACTCA ATCATAAGAT GCCCCCTTTC AATAACTCTA CTAGACTTCT TTTCTCCATC 840 

CTAGAAGCTA AGCCTAGCAC AAATAGTATA TCCAGACATG TAAAACCTGC AAACAGTAGA 900 

AGTGGTAAGA ACGCATGGGC AAAGAAAATC AAGACTAGAA GAGGGAAACT ATAGCCCAGC 9 50 

AAGAGCAGAA CGAGGAGAGG ACGGTAGCGA TCGACCAGTT TCCACCCCAT AAACTTCTTG 102 0 

GTAATGATAT CCCTGCGCTT CAATAAGAAA GTTGTTACTA GTAAGAAGTA GGAAATCATC 1080 

ATGCTAAGGA GACCAAACAA AGCAAAGAGT AGGTTAAAAT TCCGAACAGC ATCTCGATAA 114 0 

GAATCCACTT TCTCTTGTTG AATGGCTTGA ATAGATGAAA ATTTTAAATA ATTTCCATCT 12 00 

GACAATTTCT CAACTAACTC TGTAATCTCT TTTTGATGTT GAACCGTATT TTCAATTTTA 12 6 0 

ATCGGATTAT TTAAGCCAGT TGTTGACAGG GAGGCTTTCT CATCCCACAT CATATCAGAA 13 2 0 

TCATTGACCA AGCTAATAAT TGGATTGGAG AGATTTTCCT TTCGCTTATC ACTATATGGG 13 80 

AAAAATGACC AATCTCCTTC ATAATAGGCA ATCTCGACAT CCATCTCCTC TATCGTTCGT 144 0 
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TTTTGCTGCT CTTCATACTT CATCGAATGA AAGGCAATTA ACTTCCCCAA GAGCTGATTT 1500 

TTATCTTCTT CACCTTTCGT ACTTGCTGGC ATCAAAATAA CTTTTTTAAT ACCGGTATTT 1560 

GGTAGCTTGA ATCCCTTGCT CTTTAGAAAA TTGCGATTGG CATAGTAAAC ATCCACCGTA 1620 

TCTGTTAACT GATATTGCTG AATCTGTTCT GATTGGACAA AATTTTTTAC AGGAAGACTG 1680 

CTACTCTGCA CATAGCCCGC CTGCGTTTTT TCTACCAAAT CCTGATAAAA TCGATAGAAA 1740 

TAATCTGTAG ATTTCCCTGA CCCTGCTAGC TCTTCTTGCC ACAGATTATC ATTGAGTTTG 1800 

AAGGTTTCTA AGGTCAGGTA ATTACCTTGA CTTACCCACT GTTGCTGATA AGCAAGTTCT 1860 

TTGTTTTCTT GTTCTAAACT TCTGCCCACC CCAATCAGTA AGGCCGTCAG TAAAATAGTT 192 0 

GTCCCTATTT TCATCACATA ATTGAAGATA AGACCAAATT TGAAAGATGA AAAACCTTTC 1980 

AGCAGAGAGC TGATTGTCAT TTTTTGGATT AAGAGGTAAG TCAACCAACT GATAAAGAGA 2 040 

TAAAGCTGCA AC AG C AAAAA ATGAGACAAC CACAGCATAG GAAACAAATC TTTTGGCTTA 2100 

TAATCAAGCA AGAAAAACAC GCCTAGATTG ATCACAAGAG CCCCACCTAG GAGGAGGTAA 2160 

AGGTTGCCTT TTACAACATC AGCTAAAACA GCCCTATCTT GAAAACCAAG TAATTTTTGT 2220 

ACCCCAACTC TTTTCATCTC CATCATCGGT TGATACACTG TCACTAACAC AAGAAGCAAA 2 2 80 

ATAGCCAAGA CAAAAACAAT GGCAGATAAA AGCAAATCTC CATTTATGAC TTCCACTGCA 2 340 

CTTTTGTAGG TCGGCTCTAG CAAGGTAGCC TGGTCTATCT TGAAAAAATC GCTCCATTTC 24 00 

TGTACAATCC TATCCTTGTC CATCTCTTGT GTAGAAGTTA TCGTATAGCG ACCATTTAAA 24 50 

CTACGAGATG TATCCTTGAT ATAGGTTTGA AAAGTCATAA GCTGAATAGG TTTGGCTTTT 2520 

AGAAAGGTCG GAATCGTACC AAGTTTATTG GAAATTTCTT TATTACTATA GACTCCTTCA 2 5 80 

CCATCTGTGG TAAAATCAAG AGAAGAAATC CCAAACTCTT GGTAGGGGAA GGTATCTTTA 2 64 0 

TCAAAAACAC CAGACTTGAC CACCTCATCA CCACTGTCTG TTTTGATGAT GGAGACTTTA 27 00 

TACTCCTTTG ATACATCCTC AAAAAATCGA AGAACAGACG CTGCAGGTTC GTTAATATCT 2 7 60 

TTCAAATACA AATCCAAAGA ATCTACAGG 2 7 89 
(2) INFORMATION FOR SEQ ID NO: 255: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2495 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 

CTGCGAATTT TATTAAAGAT AATGTGTTAA TTACAGCGGC TCACAACTAC TACAGACATG 6 0 
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ACTATGGGAA AGAAGCGGAT GATATTTATG TTCTTCCGGC TGTTAGTCCA AGTCAAGAAC 12 0 

CATTTGGAAA GATCAAAGTA AAGGAAGTTC GTTATTTGAA GGAATTTAGA AATTTAAATT 180 

CTAAGGATGC AAGGGAATAT GACTTGGCTT TATTAATTCT AGAAGAGCCC ATTGGTGCAA 240 

AATTAGGGAC TTTGGGTCTT CCTACTAGTC AAAAAAATTT GACAGGAATA ACTGTGACTA 3 00 

TCACAGGCTA TCCATCATAT AATTTTAAAA TTCATCAAAT GTATACAGAT AAGAAACAAG 3 60 

TTTTAAGTGA TGATGGCATG TTCTTGGATT ACCAAGTTGA TACTTTAGAG GGGTCTAGTG 42 0 

GATCTACAGT TTATGATGCT AGTCACCGTG TAGTAGGAGT GCATACTTTA GGAGATGGAG 480 

CTAATCAAAT TAACAGTGCA GTTAAATTAA ATGAACGAAA TTTGCCATTT ATTTAwTCGG 54 0 

TTCTTAAAGG TTACTCTCTT GAAGGATGGA AGAAAATAAA TGGTAGTTGG TACCATTATA 600 

GACAACATGA TAAACAAACG GGTTGGCAGG AGATAAATGA TACCTGGTAT TATTTAGACA 6 60 

GTTCCGGTAA GATGCTTACA GATTGGCAAA AAGTCCATGG AAAATGGTAT TATCTCAATT 72 0 

CAAATGGAGC AATGGTTACA GGTAGCCAAA CTATCGATGG TAAAGTTTAT AACTTCGCTT 78 0 

CATCTGGTGA GTGGATTTAA TGTTGGAGGA TATATAAAAT GAAGCTTTTG AAAAAAATGA 84 0 

TGCAAATCGC ACTAGCCACA TTTTTCTTCG GTTTGTTAGC GACAAATACA GTATTTGCAG 900 

ATGATTCTGA AGGATGGCAG TTTGTCCAAG AAAATGGTAG AACCTACTAC AAAAAGGGGG 960 

ATCTAAAAGA AACCTACTGG AGAGTGATAG ATGGGAAGTA CTATTATTTT GATCCTTTAT 1020 

CCGGAGAGAT GGTTGTCGGC TGGCAATATA TACCTGCTCC ACACAAGGGG GTTACGATTG 1080 

GTCCTTCTCC AAGAATAGAG ATTGCTCTTA GACCAGATTG GTTTTATTTT GGTCAAGATG 1140 

GTGTATTACA AGAATTTGTT GGCAAGCAAG TTTTAGAAGC AAAAACTGCT ACGAATACCA 1200 

ACAAACATCA TGGGGAAGAA TATGATAGCC AAGCAGAGAA ACGAGTCTAT TATTTTGAAG 12 60 

ATCAGCGTAG TTATCATACT TTAAAAACTG GTTGGATTTA TGAAGAGGGT CATTGGTATT 132 0 

ATTTACAGAA GGATGGTGGC TTTGATTCGC GCATCAACAG ATTGACGGTT GGAGAGCTAG 138 0 

CACGTGGTTG GGTTAAGGAT TACCCTCTTA CGTATGATGA AGAGAAGCTA AAAGCAGCTC 144 0 

CATGGTACTA TCTAAATCCA GCAACTGGCA TTATGCAAAC AGGTTGGCAA TATCTAGGTA 1500 

ATAGATGGTA CTACCTCCAT TCGTCAGGAG CTATGGCAAC TGGCTGGTAT AAGGAAGGCT 1560 

CAACTTGGTA CTATCTAGAT GCTGAAAATG GTGATATGAG AACTGGCTGG CAAAACCTTG 1620 

GGAACAAATG GTACTATCTC CGTTCATCAG GAGCTATGGC AACTGGTTGG TATCAGGAAA 1680 

GTTCGACTTG GTACTATCTA AATGCAAGTA ATGGAGATAT GAAAACAGGC TGGTTCCAAG 1740 

TCAATGGTAA CTGGTACTAT GCCTATGATT CAGGTGCTTT AGCTGTTAAT ACCACAGTAG 1800 



WO 98/18931 



PCTYUS97/19588 



1282 

GTGGTTACTA CTTAAACTAT AATGGTGAAT GGGTTAAGTA ATGAAGGCTA ATTGTAAACT 1860 

GTGATGGATA CTTAACTTTG TATAATAGGT GGATAAAAGT CTTCACAATC AAAAAACGCA 1920 

TAGTATCAAG GTTTTTCTGT ACTGCCCTCA AACAGTTAGA CAATTAATTT ATCCGAAGgA 1980 

TTTAGTTCTG TATTGCACAG GGCTAAGTCC TTTTAGTTTT ACCTTAATTC GTTTATTGTT 2 040 

GTAGTAATCA ATATAGTCTA TAATGGCTTG TTCCAATTGC TTAAGCGACT GAAACGACTT 2100 

CTCATAACCG TAAAACATTT CCGATTTCAG AATCCCAAAG AAGGACTCCA TCATACTATT 2160 

GTCTGGGCTG TTTCCCTTAC GTGACATGGA TGCTTGAATT CCCTTACTCT CTAGGAACCG 2220 

ATGATAAGAA TCGTGTTGGT ATTGCCAGCC TTGGTCACTA TGGAGAATCG TATTCTCGTA 2 2 80 

GTGCTTCTCT GTGAATGCCT GTTCCAACAT TGTTTGTACT TGTTCTAAGT TGGGTGAAGT 2 340 

TGAAAGATTA TAGGCGATAA TTTCGCTATT AAAGCCATCT AAAACTGGTG ATAAGTAAAG 24 00 

CTTTTGAGTA CTTGCTGGAA TGGCAAATTC TGTCACATCT GTGTAGCACT TTTCCATTGT 2 4 60 

TTTAGAGCCT TCAAATTGGC CTTGAATGAG ATTCG 2 4 95 
(2) INFORMATION FOR SEQ ID NO : 256: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

TACCACCGTA TTCATCCAGC AAGATTGCCA TTTGTCTTTG GGTATTTCGC AGTTCTTTTA 50 

GCAAGTCATC CACAAAAATA GTTTCAGGTA CAAAAAGTGG ATCTTGTAAA ATTCTCTTCC 120 

AAACAATATT GTCAAAACCG TCCACAAAGC CTGCCTTAAG GAGACTCTTG GTGTGAATGA 180 

TTCCAATTAC ATTGTCCTTA TCCCCATCAT AAACCGGGAT ACGAGAATAA TTTTGTTTTA 240 

AAATACTTTG GATAATGGCT TGACTATCAT CCTGAATATC CACCATAAAG GCATCCGTTC 3 00 

GAGGAACCAT AACCTCTCGT GCCATCAGTT CATCGAGCGA AAAGACACCT TGTAGCATCT 3 60 

CAATCTCATC AGCATCCAAT GTTTCTTCAC TATTTGTCAG CATATAGGCA ATTTCATCAC 420 

GGGTCATCTT TTCATCCGCA TCATCGAATG ACATAGGAGT CAAATGGCTC AAGAAATTGG 480 

TCGAAGCAGC TAAAAGCCAA ACAAAAGGAC TGACTAGTTT TCCGATCCCA ATGATAATCG 540 

GCGCTGTACG AATTGCCAAG GCATCCTTTA GATTAAGAGC GATTCTCTTA GGATATAATT 600 

CCCCAAAAAC GATGGAAATA TAGGTCAAAA ATGCCAAGGA TAGAAAAGTT GCCACGGCTT 6 60 

GTGCTGTTTC GCCATTCCCA AGCCAAGAGG CAATCACACG TCCTAGAGTA TCAGTTAAAC 72 0 
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TCGCCCCTGA TAAGATTGTA ATCAGGGTGA TTCCTACCTG GATGGTTGAT AAAAAGTGGT 7 80 

TAGGATTTTC TAGTACCTTC AGCAGGCGGA TGTAGCGTCT GTCTCCTTCT TCCGCCTTTT 840 

GTTCAACTCG GGCACGATTA AGAGAAACGG 870 

(2) INFORMATION FOR SEQ ID NO: 257: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

CGTTCCCAGA AGCCCGCATT CTCATCGCCA ATGTCGTGAT TGATTTGGCC CTTTCTCCAA 60 

AATCCAACTC AGCCTATGTA GCTATGGATA AGGCACTTGC TGACCTCAAA ACATCAGGGC 12 0 

ACTTGCCTAT TCCGCGACAC CTGCGTGATG GGCACTACAG TGGAAGCAAG GAACTGGGGA 180 

ATGCCCAAGA CTATCTCTAT CCACACAACT ATCCTGGAAA TTGGGTCAAG CAAGACTATC 24 0 

TGCCAGAAAA AATTCGTAAT CATCACTATT TCCAAGCAGA AGATACTGGT AAATATGAAC 300 

GGGCTTTGGC TCAAAGAAAG GAAGCTATCG ACCGTTTGCG AAAAATCTGA AATCCTTTTC 3 60 

AAAAAATTGC ACTTTCCTCT TGATTTTTTT TGAAAAAGTG GTATCATATA AATATAGAAA 420 

CGCTGTGGTG TACGACTTCA CACTTAAGTG TTGACCGACT ATTTTTTGTA TTATTAGGGA 4 80 

AACAAAAGTC TTCTAACAGC ATGTAGGCCG TCTCACACGG AAACAGCTTC AGTTAGAGCG 54 0 

AGTTGCCCAC CTGCTTAATT GCGCGGGTTC AATACAAACC GTGAAGTTTC GGCACCAATA 60 0 

CAGCTTTTTT CTTTGCCTCC TTAGCTCAGC TGGCAGAGCA GCGGACTCTT AATCCGTGGG 660 

TCACAGGTTC GATCCCTGTA GGGGGCATAT AAATACAACA GGAAAAGCCT TATAATATAG 72 0 

GGCTTTTTTT GCTTTCCTTT TAAAAATTGT CGTGCAATTT GCCGTGTTTT TACAACAAAC 7 80 

TTTTCACAGC CATAAACTCC TCACTAATTT TTTCCTCCAA GGTATGCCCA TAAACGTCAA 84 0 

TCAACATGGA GATATCTTTA TGTCCTAAAA TTTGGCTCTT TGTCAACTGT AGTGGGTTGA 900 

AGTCAGCTAA GCTCGAGAAA GGACAAATTT TGTCCTTTCT TTTTTGATAT TCAGAGCGAT 9 60 

AAAAATCCGT TTTTTGAAGT TTTCAAAGTT CCGAAAACCA AAGGCATTGC GCTTGATAAG 102 0 

TTTGATGAGA TTATTGGTCG CTTCCAATTT GGCGTTAGAA TAGTGTAGTT GAAGGGCGTT 1080 

GACGATTTTC TCTTTGTCCT TTAGAAAGGT TTTAAAGACA GTCTGAAAAA GAGGAGGAAC 114 0 

CTGCTTTAGA TTGTCCTCAA TGAGTCCGAA AAATTTCTCC GGTGCCTTAT TCTGAAAGTG 12 00 



WO 98/18931 



PCT/US97/19588 



1284 

AAACAGCAAG AGTTGATAGA GCTGATAGTG ATGTTTCAAG TCTTG 1245 
(2) INFORMATION FOR SEQ ID NO: 258: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 1684 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 258: 

ATGCCTATGT AACTCCACAT ATGACCCATA GCCACTGGAT TAAAAAAGAT AGTTTGTCTG 60 

AAGCTGAGAG AGCGGCAcCC AGGCTTATGC TAAAGAGAAA GGTTTGACCC CTCCTTCGAC 12 0 

AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA GAAGCTATCT ACAACCGCGT 18 0 

GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC AATCTTCAAT ATACTGTAGA 240 

AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT TACCATAACA TCAAATTTGA 300 

GTGGTTTGAC GAAGGCCTTT ATGAGGCACC TAAGGGGTAT ACTCTTGAGG ATCTTTTGGC 3 60 

GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG CATTCAGATA ATGGTTTTGG 42 0 

TAACGCTAGC GACCATGTTC AAAGAAACAA AAATGGTCAA GCTGATACCA ATCAAACGGA 480 

AAAACCAAGC GAGGAGAAAC CTCAGACAGA AAAACCTGAG GAAGAAACCC CTCGAGAAGA 540 

GAAACCGCAA AGCGAGAAAC CAGAGTCTCC AAAACCAACA GAGGAACCAG AAGAATCACC 600 

AGAGGAATCA GAAGAACCTC AGGTCGAGAC TGAAAAGGTT GAAGAAAAAC TGAGAGAGGC 660 

TGAAGATTTA CTTGGAAAAA TCCAGGATCC AATTATCAAG TCCAATGCCA AAGAGACTCT 720 

CACAGGATTA AAAAATAATT TACTATTTGG CACCCAGGAC AACAATACTA TTATGGCAGA 7 80 

AGCTGAAAAA CTATTGGCTT TATTAAAGGA GAGTAAGTAA AGGTAGCAGC ATTTTCTAAC 840 

TCCTAAAAAC AGGATAGGAG AACGGGAAAA CGAAAAATGA GAGCAGAATG TGAGTTCTAG 900 

TTCTCATTTT TTTCATGAAA ATGTGCAAAA TATAGTAGAT TGAAACTAGA AT AGTAT AC C 9 60 

TCTACTTCTA AAACATTGTT AGAAATCGAT TTGACTGTCC TGTTCTTATT TCATTTTACT .1020 

ATATCTTAAC AGATAGTGTA AATAAAGATA AACTATTTAC TGGCTAATTA ATCAGTTAAA 1080 

CACTAGTTAA GGAGTAATGA TGAAAAAAAG AACAATACTA TTATTGATGG CCAGTCTGTT 1140 

AGCTCTTGTC TTAGGAGCAT GTGGTTTCTT GGACATATTG ATCCTGGATC ATTCTCATCA 1200 

GGATTACTCT TTACTGCTAT TTTAGAAACT GGGGTGGTTT GATGGAAAGT ATTGGTCTTG 12 60 

TTATCGTTTC ACATTCCAAA CACATTGCAG AAGGTGTTGT TGAACTGATT AGTAAAGTAG 1320 

CTAAAGATGT TCCGATTACT TATGTAAGAG GAACCGAGGG CGGAGGAATT GGAACGAGTT 1380 
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TTGAACAAGT AGATAGGGTT GTTTCCGAAA ATCCAGCAGA TACTTTACTT GCCTTTTTTG 144 0 

ACCTAGGTTC TGCTAAAATG AACTTAAAAA TGGTGACTGA TTTCAGTGAT AAAAGTATCA 1500 

TCATCAACAG GGTTCCAATT GTAGAAGGTG CCTATAATGC AGCTGCTCTT CTTCAGGCTG 1560 

GTGCAGAACT GTCAGTTATT CAAACACAGT TaGCGGAgCt TGAAATCAAT AAATAAGGAA 162 0 

TTTTACTATA ACTCTTTTTA TAGATAAGCT ATTGaTTATC TCAACTATAA TAATGTTAAG 1680 

TnAA 1684 
(2) INFORMATION FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 970 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 

AGGAGTGGAG AnATATGAAG ACACAAATTT TCACATTATT GAAAATCGTT GCTGAGATTA 60 

TTATTATTTT GCCATTTCTA ACTAATCTAT AAGTTCTTTA TATTGCTGAA AACGCAATTC 12 0 

AAAAAGGGCT ATTAATTGTG GATTTTCTAA TACCTGCAGA GATTGGATAA AGCGTTCAAT 180 

CTCTTTTTGA TTGCTTCCCT TTGTTTGAAG AAAGACACTC ATCTTCTTTA AAAATTGCCA 24 0 

CGATACTTTT TCAAAAACAT CATACGGTCG TAACATCCTC TCCAACTCGG CTTCGAAGAT 300 

TGGGATGTAG GAGAAAAGTT TTCGCTCCAT GAGTTCTGAT AAGATATTTA AGAGTCCTTG 360 

CTTCATATAC AATCGATTGT GTACTAACTC TTTAAATTCT TTGGATTTTT CGAGTAAGGA 420 

GGTTGATAAA AAAAT C AG AT CTTGATTGCT CAAGAAGGGC ATGGTATTGC AAAAGAGATA 480 

GAGTTCAAAC CAGGTCCAAG ACTCGATAGC ATAGAGATAG GTGGTCAAAA ACTCGCTATC 540 

CTCCTCTGCT AGTGGGTAGC TTTTATTTAG TGAATGGATG GCATCTTTAA TCACGATGGC 600 

ATTCAAACGA CGATAGGTCT GCGCCATCTG TTCTTGATCG ACTTCCTCCA ATAGCTGCTC 660 

TAAAGCAGCT ATATCCTGAT GGGCAAAGCG ATTCACAACC TTTCGACCGA TTCGCATATG 720 

TGGAGATTCT TGATAGTTGT TGAGCTTGTG CCCAAACTCA TCAAAGGTCA CATTTATACC 780 

TTGGATAGCT AGAATCAACT TATCCGCAGA CAGCATAGAC TGCCCTAGTT CAAACTTGGA 840 

CAACTGAGAA GCTGTTAGAC CCTCACAAGC CACATCTGAC TGCTTGAGCT TTCTCGCCAA 900 

ACGTAATTCC TTGTAAAATT CCCCCAGTTC CATTCTCTCA ATCATCTGAC CACCTCCTAG 960 

CTTTTGCAGG 970 
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(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2996 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 60: 

GTTGACCACG GGTAAAACTA CCCTAACTGC AGCTATCACA ACTGTTTTGG CACGTCGCTT 60 

GCCTTCATCA GTTAACCAAC CTAAAGACTA TGCGTCTATC GATGCTGCTC CAGAAGAACG 12 0 

CGAACGCGGT ATCACTATCA ACACTGCGCA CGTTGAGTAC GAAACTGAAA AACGTCACTA 180 

CGCTCACATC GACGCTCCAG GACACGCGGA CTACGTTAAA AACATGATCA CTGGTGCTGC 24 0 

TCAAATGGAC GGAGCTATCC TTGTAGTAGC TTCAACTGAC GGACCAATGC CACAAACTCG 300 

TGAGCACATC CTTCTTTCAC GTCAGGTTGG TGTTAAACAC CTTATCGTCT TCATGAACAA 3 60 

AGTTGACTTG GTTGACGACG AAGAATTGCT TGAATTGGTT GAAATGGAAA TCCGTGACCT 420 

ATTGTCAGAA TACGACTTCC CAGGTGACGA TCTTCCAGTT ATCCAAGGTT CAGCACTTAA 480 

AGCTCTTGAA GGTGACTCTA AATACGAAGA CATCGTTATG GAATTGATGA ACACAGTTGA 540 

TGAGTATATC CCAGAACCAG AACGTGACAC TGACAAACCA TTGCTTCTTC CAGTCGAGGA 600 

CGTATTCTCA ATCACTGGAC GTGGTACAGT TGCTTCAGGA CGTATCGACC GTGGTATCGT 660 

TAAAGTCAAC GACGAAATCG AAATCGTTGG TATCAAAGAA GAAACTCAAA AAGCAGTTGT 720 

TACTGGTGTT GAAATGTTCC GTAAACAACT TGACGAAGGT CTTGCTGGAG ATAACGTAGG 7 80 

TGTCCTTCTT CGTGGTGTTC AACGTGATGA AATCGAACGT GGACAAGTTA TCGCTAAACC 840 

AGGTTCAATC AACCCACACA CTAAATTCAA AGGTGAAGTC TACATCCTTA CTAAAGAAGA 900 

AGGTGGACGT CACACTCCAT TCTTCAACAA CTACCGTCCA CAATTCTACT TCCGTACTAC 960 

TGACGTTACA GGTTCAATCG AACTTCCAGC AGGTACTGAA ATGGTAATGC CTGGTGATAA 1020 

CGTGACAATC GACGTTGAGT TGATTCACCC AATCGCCGTA GAACAAGGTA CTACATTCTC 1080 

TATCCGTGAG GGTGGACGTA CTGTTGGTTC AC-GTATGGTT ACAGAAATCG AAGCTTAATT 1140 

CGATTTAGTT CCCAGAAGAA CAATTATTTA AGTTAGACAC TAAAAGAATC TTGCTTGGCA 1200 

AGGTTCTTTT TTTAGATATT GAACTAATAC TCAATGAAAA TCAAAGAGCA AACTATAATA 1260 

TATTGAAACT AGAATAGTAC ACATCTACTT CTAAAACATT GTTAGAAATC GATTTGACTG 13 20 

TCCTGATCGA TTTGTCTTGT TCTTATTTCA TTTTACTATA GAAAGTTAGC TACAGACTGC 13 80 

TCAAAACATT GTTTTTAGGT TGTAGATAGA ACTGACGAAG TCAGtAACAT CTATACGACA 1440 
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AAAGCATTAT ACAATAGTAA TATGAAATCA ATTAAAGAAG 
CTTTTAAAAG ACTCTCGTAC AGCTAAATAT CATAAACGCC 
CTGATGGGCA AATCTTATAA AGAGATTATA GAACTTTTAT 
GAACAACTCT ATCAGGAAAG TCAAACTAAT TTATAGAAAT 
CTGTTATAGA TTCAATACAC TTTAGACTGT AATCAAACAA 
AATATGAGGA GTTCGGACTC GACTCTCTCC TTCAAGAAAC 
CTTATATGAC GGTTGAGCAA GAGAAAGTCT TTCTTGCCCG 
CAGGAGAATT TGTTACAATT GATGCCTTAT TTCAGGCTTA 
CCTACACACG TGATGCCTTC TATCAACTGT TGAAGCGCCA 
CACGTCCAGA ACATCCTAAG AAAGCAGATG CTCAAACCAT 
TCTCAATTCA AGAAGACAAG TGAACTGCAC CCCAAAAGTT 
TTGGGGTGTT TTTATTATGA AATTAACTTA TGATGATAAA 
AAAACAAGGA TATAGCTTAG AGAAGCTTTC AAATAAATTT 
TAGGTACATG ATTAAATTGA TTGATCGTTA CGGAATAGAG 
TCGTTACTAT TCTCCTGATT TAAAACAAGA AATGATTCAT 
GACTAAAGAT AGAGTTTCTC TTGAATACTG TCTCCCAAGT 
GCTAGCACAA TACAGGAAAA ACGGGTATAC TATTGTTGAG 
TGAGAGCGGA GAATGCCATC CTAAAAAAGT TAAGAGAACT 
AGAAAGAAGA AAGACAGAAA TTATTCAAGA ATTAATGACT 
TCTAAAAGCC ATTAAACTAG CTCGTTTGAC CTACTACTAT 
ACCAGATAAG GACCAAGAGC TTAAAGCTGA AATTCAATCC 
AAATTATGCT TATCGTCGGA TTTATTTAGA ACTAAGAAAT 
TAAAAGAGTT CAAGGCTTGA TAAAAGTACT CAATTTACAA 
AAAATATTCT TCTCATAAAG GAGACGTTGG CAAGAAGGCA 
ATTTGAAGGC TCTAAAACAA TGGAAAAGTG CTACACAGAT 
(2) INFORMATION FOR SEQ ID NO: 2 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 837 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 



TAATACTAGA CTAAAATCAA 1500 

AAATCCAAAC CATCAAAACA 156 0 

TTCAAATCGT TCTATTTCGT 162 0 

AGTGGTTTGA AATAAGATGT 1680 

ATTTTAGCAG CCAAGGTGTA 174 0 

CGATTTGGCG AAATGTAAAA 1800 

ACGTGGTGGT CGTAACCATG 1860 

CCATTTGAAG GCTACAGAGG 192 0 

TAAAAAGGAG TTAGGTCGTT 1980 

TGGTTGGCGA AATATTACGC 204 0 

TGTCGCGTCT AAAAATAAAG 2100 

AGACAGAAAA AATCTAACTT 2160 

GTTCAGATCT ATGAACTTAG 2 22 0 

GGGATAAACA ATTCTAATCT 2 2 80 

TTCGTCAAAA AAGGAAAAAA 2340 

AAAGTCTGAC ATGAAGGCTG 2 400 

CGTACGATAC TTCTTAACTG 2460 

AAAACAAGAG GGAGAGTACC 2 520 

CCGATTGAAG GAGGAAAAAG 2 580 

GAGTTTTCGT TAGATATTCT 2 64 0 

CACTTGAAAC AGCTAGATAA 2 700 

ATTTTTATCG AACACAAGGG 2 760 

CGTGGTTATC TGGTAAATCA 2 820 

GCTAAAATGC GACAGAAACG 2 880 

GAGAATCTCA TTCAAGGACA 2 940 

GTGACAGAAT TTGCCG 2 996 
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(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 
CTTATCAACT CCCGACATGG CTCTCAGACC AATCCAAATC CCTAAAAAAA TCAGAACAAG 
GATGGTGGTC AAGATCAAAC TCTCGAAATA TAAAGAAAAT AGTTGCAGTA GCATGATTTC 
TCTCATTTCT ATCTTTTTTA AAGAGTAAAC TCAGCTAGTC CAACTAACTG AGTTTTCCTT 
TATCTATTAT ATCAAATATA AGTCCGTTTG TAACTAGCGA AGAATTCTTT TGTCCGCTCT 
TCTTTAGGGG TGTGGATAAT CTCATCCGGA GTTCCAGACT CGATGATTTT CCCCTTATCT 
AAGAAGAGAA TTTTATCCGC AACTTGGGCT AL'AAAGGACA TGTCATGACT GACCAAAATC 
ATGGTCTGAC CTGACTTAGC AGCATCTGCA ATAGACTTTT CTACTTCACC GACCAATTCT 
GGGTCAAGGG CTGAAGTTGG TTCGTCTAAG AGCAAAACAT CTGGTTTCAT AGCAAGCGCA 
CGCGCTAGGG CAACCCGTTG CTTCTGTCCA CCTGATAAAT GGCGAGGATA ATGGTTTTCA 
CGGTCCGAAA GCCCAACCTT AGCCAACTCT TCCTTGGCAA TCTTAGTCGC TTCTTGGTCA 
GATAATTTCT TGACAACAAC CAAGCCTTCT TTCACATTAT CAAGTGCTGT TCGGCGTTCA 
AACAAATTAA ACTGTTGGAA AACCATAGAC AACTTACGAC GTAGGGCAAG GATTTCTTCT 
TGAGTGATTT TAGAAAAATC AACTGAAAAA CCATCAATCT GAATAGAGCC ACTGTCAGGT 
GTTTCTAGAT AATTGAGACT GCGAGAAAGG TTGATTTTCA GCTCTGAAGA CCAATCA 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 868 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 
CCGAACAAAA TGGGCTAATT AGATTATAGT AAGAAAGGTA AGTTAAAAAT GAGAATTGCA 
ATTGGATGTG ACCACATCGT AACTGATGAA AAAATGGCGG TTTCAGAATT TTTGAAATCA 
AAAGGATATG AAGTCATTGA CTTTGGTACC TATGACCATA CACGGACTCA CTACCCAATC 
TTTGGTAAAA AAGTAGGGGA AGCTGTAACT AGCGGTCAAG CTGATCTTGG AGTATGTATC 
TGTGGTACTG GTGTTGGTAT CAACAACGCT GTAAATAAAG TTCCAGGTGT TCGTTCTGCC 
TTGGTTCGTG ATATGACAAC AGCCCTTTAT GCTAAAGAAC AATTGAACGC TAACGTTATT 
GGTTTTGGTG GTAAAATTAC TGGTGAATTG CTTATGTGTG ATATCATCGA AGCTTTCATC 
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CATGCTGAAT ACAAACCAAC TGAAGAAAAC AAAAAATTGA TTGCGAAAAT TGAACATGTT 480 

GAAAGTCACA ATGCTCAACA AACAGACGCA AACTTCTTTA CAGAATTCCT TGAGAAATGG 540 

GATCGTGGAG AATACCACGA CTAAGAGGTG ACCTATGATT TTAACAGTCA CAATGAACCC 600 

ATCCATCGAT ATTTCCTATC CCTTGGATGA GTTGAAGATT GATACTGTCA ATCGTGTGGT 660 

GGATGTAACC AAAACGGCTG GTGGTAAGGG ACTCAATGTT ACCCGAGTAC TTTCAGAATT 720 

TGGCGATTCT GTTCTTGCTA CTGGTTTAGT GGGTGGCAAA CTTGGTGAGT TTTTGGTTGA 780 

ACATATCGAT AATCAAGTAA AGAAAGATTT CTTCTCAATT AAGGGAGAAA CTCGTAACTG 840 

TATCGCTATT CTCCACGGAG ACAACCAA 868 
(2) INFORMATION FOR SEQ ID NO: 263: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

3 TCTTCATAAG ACTCGAAAGT CACAGTTCTT TCGTTCTTGC TGGCATCTAT 
ATAGGTAATT TCAATCATGT TTAAAACTCC TTTGTTTAAT GCTAACTTTA TTTTACTCCT 
TATAAAAGAG AATGTCAAGA AAAATGATTG CGCACGCAAC TTTTTTTAAA ATCATCTTAA 
ATCAAGAAAT CCAAACCTGC TTCCAAGCTT TCTTCGACAG TCTTTTGTAG CGAGGCCAGT 
GTCTTTTGCC CATCATTTGT CAGGCAGATA AAACTAGAGC GTCTATCTTG ATGGCAACAC 
ATGCGACTGA GTAGACCGCA ATTTTTAGCT TCCAAGCGAG CCACCATCCT AGAAACTGCG 
CTCGGGCTCA GATGAAGCTT ATCTGGCAGG TCAATCTGGC GTAGAGATTT TTCTTCAGCC 
AAGTCCAGAT AGTAGAGCAG GTAGAACTCT TTCAAGGTCA GACTTTGCTC GCTCTGTTGG 
GCAATGGTCT CTTCCAAGAG ACTTTCAATT TCTTTCTGAC GCCGATTGAA GTCAAACCAT 
TTTTCCAAAT AGGTCATAGT GTCTCCTTTC TTTTTAGAGT CATAAATAGA AGAAAGTCCA 
TTAACGGGCA GTCTCTGCGT CACAAGATGA TTGCGCATGC AATAATTATA CTACTTTTCA 
AGAATGCTGG CAAGCTCTGT TTTTTAGTGG TTTTATTTTT GTGTGAATAA TGGGGGAATC 
CTATTGTTTC AATTTCTAAC TCCTTATCAC ATTCGAATTC AGATTTTATT TCATTTCTCT 
ATCTATAGTT GCTTAGTTTA AAATAAGCAT GGTCTAATAA AGCTATGCAT ATAGTACTGA 
TTTTAAACAA GGAGCATTAG ATTCCATTAA AGGAGGGCAC AGACATGTCG AGGCGGCCAA 
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AGTTTTTGAT GTCGGCGTCA GAACTCTCTT CACGTGGGAA AAGAAAGACG TAAACAAGGG 960 

AACTTAGAGC GGAAAAAGCG AGTCGTCAAA AAGCGTAAGA TCCCTTTAGA AGAATTGAAA 102 0 

GCCTTTGTAG AGGCTCATCC AGACGCTTTT TTACGGGAAA TTGCGGCCCG TTTTGATTGT 1080 

GCTTTGCCCT CCGTATGGGC AGTTTTAAAG CAGATTAAGG TCATTTTAAA AAAGACGACC 114 0 

AGTTTTAGGG AACAAAAGCC TGAGAAAGTT TCTGAGTTTC TTGATATTTT GGATAACCTA 12 00 

AAAGATTTAC CAGTCCTATA TATTGACGAA ACGGGAATCG ACCGCTACCT CTATCGTCCT 12 60 

TATGCAGGGG CTCCTAGAGG GGAGAAAGTC TATGGCAAGA TTAGCGGACG GCGTTTTGAG 132 0 

CGGACTAATG AGGTGGAGCA AAAACTCAAT GGTAGTTTTC TAATCAGATA TATTGATTCA 1380 

CAAATTAGAG AATGAAAGAA TAATTATGCA TAAAAATAGG AATATAAACC AAAAATTAGC 144 0 

TGATTTATAC TCATTTGCGT GTCTTTATAA AAAACTTATC TTATAATATA TATATATATA 150 0 

TATACAAAAT AGTAAAATGC TTTTTTTTTT TAGCAAAAAT ACCTCAAGTT TCTTGCTATT 1560 

TTGGGTTCCC TATTCTATAA TTATAGTATG GTAATTTATT TATATCCATA CATGAAAATA 162 0 

ATACTCGAAA GGAAATTTCA AAATATTTTT TAGACGTCAG AAGGGTGAAT ATAGAGAAAC 1680 

AGACCGAGTA ACTCGGTTCA AATTAATCAA ATCAGGGAAG CATTGGCTAC GGGCCTCGAC 174 0 

TTCTCTTTTT GGCTTGTTTA AGGTCTTGCG AGGTGGTGTT GATACTACTC AGGTCATGAC 1800 

CGAAACGGTA GAAGATAAAG TAAGTCATTC AATTACTGGG CTTGATATCC TCAAGGGGAT 1860 

AGTTGCTGCG GGAGCTGTCA TAAGTGGAAC CGTTGCAACT CAAACGAAGG TATTTACAAA 192 0 

TGAGTCAGCA GTACTTGAAA AAACTGTAGA GAAAACGGAT CCTTTGGCAA CAAATGATAC 1980 

AGTAGTTCTA GGTACGATAT CTACAAGTAA TTCAGCGAGT TCAACTAGTT TGTCAGCTTC 2 040 

AGAGTCGGCA AGTACATCTG CATCTGAGTC AGCCTCAACC AGCGCTTCGA CCTCAGCAAG 2100 

TACAAGTGCA TCAGAATCAG CAAGTACATC GGCTTCGACA AGTATTTCTG CATCATCTAC 2160 

TGTGGTAGGT TCACAAACAG CTGCCGCTAC AGAAGCAACT GCTAAGAAGG TCGAAGAAGA 2 22 0 

TCGTAAGAAA CCAGCTAGTG ATTATGTAGC ATCAGTTACA AATGTCAATC TCCAATCTTA 2280 

TGCTAAGCGA CGCAAGCGTT CAGTGGATTC CATCGAGCAA TTGCTGGCTT CTATAAAAAA 2 340 

TGCTGCTGTT TTTTCTGGCA ATACGATTGT AAATGGCGCC CCTGCAATTA ATGCAAGTCT 2400 

AAACATTGCT AAAAGTGAGA CAAAAGTTTA TACAGGTGAA GGTGTAGATT CGGTATATCG 2460 

TGTTCCAATT TACTATAAAT TGAAAGTGAC AAATGATGGT TCAAAATTGA CCTTTACCTA 2 520 

TACGGTTACG TATGTGAATC CTAAAACAAA TGATCTTGGT AATATATCAA GTATGCGTCC 2580 

TGGATATTCT ATCTATAATT CAGGTACTTC AACACAAACA ATGTTAACCC TTGGCAGTGA 2 640 

TCTTGGTAAA CCTTCAGGTG TAAAGAACTA CATTACTGAC AAAAATGGTA GACAGGTTCT 2 700 
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AT C C T AT AAT ACATCTACAA TGACGACGCA GGGTAGTGGG TATACTTGGG GAAATGGTGC 2 760 

CCAAATGAAT GGTTTCTTTG CTAAGAAAGG ATATGGATTA ACATCATCTT GGACTGTACC 2 820 

AATTACTGGA ACGGATACAT CCTTTACATT TACCCCTTAC GCTGCTAGAA CAGATAGAAT 2 880 

TGGAATTAAC TACTTCAATG GTGGAGGAAA GGTAGTTGAA TCTAGCACGA CCAGTCAGTC 2 940 

ACTTTCACAG TCTAAGTCAC TCTCAGTAAG TGCTAGTCAA AGCGCCTCAG CTTCAGCATC 3 000 

AACAAGTGCG TCGGCTTCAG CATCAACCAG TGCCTCGGCT TCAGCGTCAA CCAGTGCGTC 3 060 

AGCTTCAGCA AGTACCAGTG CTTCAGTCTC AGCATCAACA AGTGCTTCAG CCTCAGCATC 3120 

GACAAGTGCC TCGGCTTCAG CAAGCACATC AGCATCTGAA TCAGCGTCAA CCAGTGCTTC 3180 

GGCTTCAGCA AGTACCAGTG CTTCAGCTTC AGCATCAACC AGCGCCTCGG CCTCAGCAAG 3 240 

CACCTCAGCT TCTGAATCGG CCTCAACCAG CGCCTCGGCC TCAGCAAGCA CCTCAGCTTC 3 300 

TGAATCGGCC TCAACCAGCG CCTCAGCCTC AGCATCAACG AGTGCTTCGG CTTCAGCAAG 3 3 60 

CACAAGCGCC TCGGGTTCAG CATCAACGAG TACGTCAGCT TCAGCGTCAA CCAGTGCTTC 3 42 0 

AGCCTCAGCA TCAACAAGTG CGTCAGCTCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 3 4 80 

ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT CAGCAAGCAC CTCAGCTTCT 3 540 

GAATCGGCCT CAACCAGTGC GTCACCTCAG CATCGACAAG CGCCTCAGCT TCAGCAAGTA 3 600 

CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG 3 6 60 

CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC GGCTTCAGCA TCAACCAGTG 3720 

CCTCGGCTTC AGCGTCAAAC AGTG 3744 
(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 264: 
CGATAAAGAG GCCTTGAGTA ATCTCAATTT GCAGATTGAA AATGGAGAGA TTATGGGCTT 60 
GATTGGTCAT AATGGGGCTG GAAAATCGAC CACTATAAAA TCCCTAGTCA GTATCATTTC 120 
ACCCAGCAGT GGTCGTATTT TGGTAGACGG TCAGGAGTTA TCGGAAAATC GCTTGGCTAT 180 
TAAACGAAAG ATTGGCTACG TAGCAGACTC GCCTGACTTA TTTTTACGCT TAACGGCCAA 2 40 

TGAATTTTGG GAATTGATCG CCTCATCCTA TGATCTGAGT AGATCTGACT TGGAGGCTAG 3 00 
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TCTAGCTAGG CTATTGAACG TTTTTGATTT TGCTGAAAAT CGCTATCAGG TTATTGAAAC 360 

TCTTTCTCAC GGAATGCGTC AGAAAGTCTT TGTCATCGGA GCACTCTTGT CTGATCCCGA 42 0 

TATTTGGGTC TTGGATGAAC CCTTGACTGG TTTGGATCCC CAGGCTGCCT TTGATTTGAA 480 

ACAGATGATG AAGGAACATG CACAAAAAGG GAAGACAGTC TTGTTTTCAA CTCATGTCCT 540 

AGAGGTGGCA GAGCAAGTCT GTGATCGGAT TGCCATTTTG AAAAAGGGGC ATTTGATTTA 600 

TTGTGGTAGT GTAGAGGACT TGAGAAAAGA TTACCCAGAC CAGTCTTTGG AAAGTATCTA 660 

CCTTAGTCTT GCTGGTAGAA AAGAGGAGGT TGCGGATGCG TCTCAAGGTC ATTAAAAAAT 720 

TAGTTGATAT CAATATCCTT TATTCATCTC AAGAAGCTAA TCTGGCTAAT CTACGAAAGA 780 

AGCAGGCTAA GAATC 7 95 
(2) INFORMATION FOR SEQ ID NO: 265: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 

TGGTAATGTG CTTGGCAGCw TCCTTGACAC TCCTACTACC ATTTCCCATA GCGACCGACA 50 

TACCAACGCC AGCCAGCATT TCAAGATCAT TATCTGAGTC ACCAAAAGCC ATGACTTGGT 120 

TGAGGTCAAA GCCATATTCT TTCCCAACTC GGCGAATGCC TTCTAATTTA GAATTTCCCT 180 

GATTGATGAC ATCCGATGCA AAAGGATTGC TACGTGTCAA TTTCAAGTCT TCAAAATCAG 240 

CTGCCGCCTT CTCAGATTCT TCTGGTGTCA TCAGCATCAA AACTTGGTAG ATAGGCTGAT 3 00 

TCATCAGGTG AAGCAGGTCC TCTTCCTTTT GGGGAACAAC CTTGCTGACC ATGCGATTAA 3 60 

AAGACTGACT CACCGTCCGA GTTAAAACAG AGGGAACGAA GCGACTAATT CGTTGGGAAA 42 0 

AAGAACCCAG ACCAAAGGAC ATGATTTTAG AACCCAACAT GGCATCCTTG GTCCCTAGAG 480 

CAATCTCCGT GCCCTCTTTT TTAGCATAGC TAATTAGATG GCGCAAATGT AACTTGGAAA 54 0 

TAGGGCTCGT GAACAAGACT CTGTCTTTAC TAAAGATATA CTGGCCATTA TAGGTTACCG 600 

CAAAATCCAG ATCCAAATCG TCCATCAATT CCTTAACAAA AAAAGGTCCT CGCCCTGTCG 660 

CTACGCCAAC TAGTACCCCT TGTTCTTTGA CAATCTTAAT CGCATCCTTA GTGGATTTCA 720 

AAACACTCTT GCGATTGTTG ACCAAGGTTC CATCGATATC AAAAAAAACA GCTTTGACTT 780 

CCATCCTATC CCAATCTCCC CTTTTGTGAT ACAATGATTA TACCACATTT CAGAAAGAGT 840 

GAGTAAATCA TGCCTAAGAA AATCCTTGTT TTACATACGG GTGGAACTAT TTCCATGCAG 900 
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GCCGATGCTT CTGGCGCTGT TGTGACGAGT TCAGATAATC CCATGAACCA TGTGTCCAAC 96 0 

CCACTTGAAG GAATCCAAGT CCACGCCTTG GACTTTTTTA ACCTTCCAAG TCCCCATATC 1020 

AAACCCAAAC ATATGCTGGT CCTCTACCAG AAAATTAAAG AGGAAGCAGA TAACTACGAT 1080 

GGAGTGGTGA TCACACACGG AACCGATACT TTAGAGGAAA CAGCCTATTT CCTTGATACC 1140 

ATGGAAGTTC CCCATATGCC TATCGTTCTA ACAGGAGCCA TGCGTACtCC AATGAGCTCG 1200 

GTAGTGATGG TGTTTATAAT TACCTAAGTG CTTTACGAGT GGCCAGCGAT GACAGGGCTG 12 60 

CTGACAAAGG AGTTTTGGTC GTTATGAACG ATGAAATCCA CGCTGCCAAG TATGTCACCA 1320 

AAACACATAC GACTAATGTC AGCACCTTCC AGACTCCAAC ACATGGCCCC CTTGGTCTCA 1380 

TCATGAAACA GGAAATCCTC TACTTCAAAA CAGCTGAACC TCGTGTTCGC TTTGACCTTG 1440 

ATCACATACA AGGTTTAGTC CCTATCATCT CGGCTTATGC TGGTATGACA GATGAGCTGA 1500 

TTGATATGCT GGATTTAGAA CACTTGGACG GTTTGATTAT CCAAGCCTTC GGAGCTGGTA 1560 

ATATTCCCAA AGAAACGGCT CAAAAATTAG AAAGCCTTCT GCAAAAAGGA ATTCCAGTCG 1620 

CTCTGGTATC ACGATGCTTT AACGGTATTG CCGAGCCTGT TTATGCATAC CAGGGTGGGG 1680 

GCGTACAGTT GCAAAAAGCA GGCGTTTTCT TTGTTAAAGA ACTCAACGCC CAAAAAGCTC 1740 

GCTTGAAACT CCTCATCGCC CTCAATGCCG GACTAACAGG ACAGGCTTTG AAAGACTATA 1800 

TGGAAGGCTA ATACTCTTCG AAAATCTCTG CAAACCACGT CACGTCGCCT TACCGTATGT 1860 

ATGGtACTGA CTTCGTCAGT TTCATCTACA ACCTCAAAAA CATGTTTTGA GCTGACTTCG 1920 

TCAGTTCTAT CTACAACCTC AAAAACATGT TTTGAGCTGA CTTCGTCAGT TCTATCTACA 1980 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TCAGTTCTAT CTACAACCTC AAAAACATGT 2 040 

TTTGAGCTGA CTTCGTCAGT TCTATCTACA ACCTCAAAAA CATGTTTTGA GCTGACTTCG 2100 

TCAGTTCTAT CTACAACCTC AAAAACATGT TTTGAGCTGA CTTCGTCAGk TCTATCTACA 2160 

ACCTCAAAAA CATGTTTTGA GCTGACTTCG TTAGTTTCAT CTACAACCTC AAAAACATGT 2 220 

TTTGAGCTGA C 2231 

(2) INFORMATION FOR SEQ ID NO: 266: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1310 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 
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GAGTCAAAGG CTCCGAGGTT GACTTTTTAC AAGGGGACAG GTGAATATTA TCTAGACCTG 60 

TCAGAAATTC TCTTCTTTGA AACAGAAGGG AGCAAGATCT ACGCTCATAA CCAGAAGGAA 120 

GCTTATGAGG TTCGCCTCAA GCTCTATGAG TTGGAGTCTA TCTTGCCTCG CTATTTTAAT 180 

CGAGTTTCCA AGTCAACGAT CGCAAACATC CGTCAGATTT ACTCAGTGGA CAAGTCCTTT 2 40 

TCAGGAACGG GCACCATTTC CTTTTATCAG ACGCACAAGG AGGTTCATGT CTCACGGCAT 300 

TACCAATCCC TCCTAAAAGA AAATCTAAGA AACATGAGGT AAAAAACATG AAAAAGAAAG 3 60 

CATTTGGTAT TGTTTTATTG GTTTTAGCAG CTTGGATCTT GCTGCAAGGG AATTTTGGAA 42 0 

TTCCTTCTTT GGATGGTAAA ATATGGCCTT TACTAGGTAT TGTTTTTTTT GCTTATAAGT 480 

CCATTGAGTC CATCCTTAGA CGTCATCTCA CTTCGGCAGT TTTTACAGGT TTACTGGCGC 54 0 

TCATCATTGC AAATTACGCT TATGACTTGT TACCAGTTAC CAATCATTCT CTTATTTGGG 6 00 

CTAGCATCTT GGTGGTACTT GGTGTTGGTT ATCTGACGCA TTCAAGTAAG TTCTGGAATG 6 60 

AAAAAAAATG GTGGTACAAT GGGAAAAAAA CAGTCGTCAC GGATAAGGAA GTCGCTTTTG 720 

GTAGCGGGAC CTTCTATAAG CAAGATCAAG ATCTCGTAGA TGACCAAGTG GAAGTCGCTT 780 

TTGGGGATGC TAAAATCTAC TATGATAATG CAGAGATGCT AGGTGATTTT GCAACTTTAA 840 

ATATTGAACT GGCCTTCGGG AATGCAACCG TCTATGTTCC ACAACACTGG CGTGTAGATT 900 

TGAAAGTAGA AACCTCCTTT GGTGCAGCTA AGGCTGACGC TCCTGTAGCC CCAACCAGCA 960 

AAACCTTGAT TATCCGTGGA GATGTGGCTT TTGGGAAGTT GGAAATTGTC TACGTTAAAT 102 0 

AAAAAAATCT TCACTTCAAC CATCAAAATA GACGTACTAA GAGTAGGAAA TTGATGCCTT 1080 

GCTCTGATTT CAGTTCTATG GTTGTTAGAC TTTAAAAAAT GAAATGCTGC CTTTAAAAGT 114 0 

TGTATATTTT TCGATATTTT GGCTTTTACG TTTGATGTAT CTATGTACTA CAGCGTAGAT 1200 

GATGTAGTGT CAAATGCTTT TAAAAAACGG ATGATATTGG ACAGTTTTTT TGCCTTTAAT 12 60 

TGCTCAGGAA CCATGAAAGT CAGTACCTGG GTTTATGACA AGGGAGAATG 1310 
(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5922 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 

ACTCTGATTT GATTGGAACG ACAGTCGGTG CCATTGCAGT TACTTCAAAC GTAACGACTT 60 

ATGTTGAGTC TGCTGCTGGT ATCGGTGCAG GTGGACGTAC TGGTTTGACA GCCTTGGTTG 120 
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TAGCTATCTG TTTTGCGATT TCAAGCTTCT TTAGCCCACT TCTAGCGATC GTACCAACAG 180 

CGGCTACAGC TCCAATCTTG ATTATCGTTG GGATTATGAT GCTTGGTAGC TTGAAAAATA 240 

TCCATTGGGA TGATATGTCT GAAGCAGTTC CTGCCTTCTT CACATCTATC TTTATGGGAT 3 00 

TCAGCTACTC TATCACTCAA GGGATTGCAG TTGGTTTCTT GACTTACACT TTGACTAAGC 3 60 

TTGTTAAAGG TCAAGTTAAA GATGTTCATG TCATGATTTG GATTTTGGAT GCCTTGTTTA 42 0 

TCCTTAACTA CATCAGCATG GCCTTATAAT AGAATGACCC AGGGGGATTT CCCCCCTTTT 4 80 

TTAATACAaG GAGATAGGTG ATGAAAGAGA AAAATATGTG GAAAGAATTG TTGAATCGTG 54 0 

CAGGCTGGAT TTTGGTCTTT TTACTTGCCG TCCTTTTATA TCAGGTTCCC CTAGTGGTTA 600 

CCTCTATTTT GACTTTAAAA GAAGTAGCCC TGCTACAGTC AGGGCTGATA GTTGCTGGCC 660 

TTTCAATTGT GGTTCTGGCT CTATTTATTA TGGGAGCTCG TAAAACCAAG TTAGCTAGTT 72 0 

TTAATTTTTC TTTTTTTAGA GCTAAAGATT TGGCACGTTT GGGCTTGAGT TATCTAGTTA 7 80 

TTGTCGGGTC AAAT AT AC T T GGTTCCATTT TATTGCAACT GTCAAATGAG ACGACAACAG 84 0 

CTAACCAGTC TCAGATTAAT GATATGGTTC AAAATAGTTC GTTGATTTCC AGTCTCTTCT 900 

TGCTAGCCTT GCTTGCTCCG ATTTGTGAGG AAATCTTGTG TCGTGGGATT GTTCCTAAAA 960 

AGATTTTCCG AGGCAAGGAG AACTTGGGAT TTGTAGTCGG TACGATTGTG TTTGCTTTAT 102 0 

TGCATCAACC AAGTAATTTA CCTTCTTTAT TGATTTATGG AGGTATGTCG ACAGTTCTAT 1080 

CTTGGACAGC CTACAAGACC CAACGTTTGG AAATGTCGAT CTTGCTTCAC ATGATTGTTA 1140 

ATGGGATTGC TTTCTGTTTG TTGGCTCTTG TGGTGATTAT GAGTCGGACA TTAGGAATTT 1200 

CTGTTTAAAA GTTTTTATGT AGGAACCGAC CTCTTTCTAC CAGGGAAAGA TGAATGCAAT 12 6 0 

CGTGTCCATC TTTTTCTTTT TATGGTAAAA TAGAAAAATA ATATGATGAA AATCCTTGAG 13 2 0 

GGAGTGACCG ATATGTCAAG TAAAGCCAAT CATGCAAAGA CAGTTATTTG CGGAATTATC 13 8 0 

AATGTAACCC CAGACTCCTT TTCGGACGGT GGTCAATTTT TTGCTCTTGA GCAGGCGCTC 14 4 0 

CAGCAGGCTC GTAAATTGAT AGCAGAAGGA GCCAGTATGC TAGATATCGG CGGAGAATCG 1500 

ACTCGGCCGG GAAGTAGCTA TGTTGAGATA GAAGAGGAAA TCCAGCGTGT TGTTCCAGTG 1560 

ATCAAAGCGA TTCGCAAGGA AAGTGATGTC CTCATCTCTA TTGATACTTG GAAGAGTCAA 162 0 

GTAGCAGAGG CTGCTTTGGC TGCTGGTGCC GATCTAGTCA ATGATATCAC TGGTCTTATG 16 8 0 

GGTGATGAGA AAATGGCTTA TGTGGTAGCT GAAGCGAGAg CGAAAGTGGT CATCATGTTT 174 0 

AACCCAGTTA TGGCTCGACC TCAGCATCCT AGTTCGCTTA TCTTCCCTCA TTTTGGTTTT 180 0 

GGTCAAACCT TTACAGAAAA AGAGTTAGCT GACTTTGAAA CATTGCCAAT CGAAGACTTG 1860 
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ATGGTGGCTT TCTTTGAACG AGCACTAGCG AGAGCGGCAG AAGCTGGTAT TGCACCAGAA 192 0 

AATATCCTGT TGGATCCAGG AATTGGCTTT GGTCTGACCA AGAAAGAAAA TCTGCTTCTT 1980 

TTACGGGACC TGGATAAACT ACATCAGAAG GGCTATCCAA TCTTTCTCGG AGTGTCGCGC 2 04 0 

AAGCgATTTG TCATCAATAT CCTAGAGGAG AATGGTTTTG AAGTCAATCC TGAGACAGAG 210 0 

CTTGGTTTCC GAAATCGGGA CACGGCTTCG GCTCATGTAA CTAGTATCGC TGCGAGACAG 2160 

GGTGTAGAAG TGGTGCGCGT GCATGACGTA GCTAGTCACA GGATGGCAGT TGAAATTGCC 2220 

TCTGCCATTC GTCTGGCTGA TGAAGCGGAA AATTTAGATT TAAAACAATA TAAATAAGAT 2 280 

GAAAGAAATT GAAAACAATC AGTGGATTGC TAACTACCGG ACGGATCAAC CGCATTTTGG 2 340 

CTTGGAACGA ATGGTGGAAC TGTTAGCTTT GCGTGGCAAT CCCCATCTCA AACTCAAGGT 2400 

CCTCCATATC GGAGGGACTA ACGGCAAGGG CTCGACTATT GCTTTTTTGA AAAAGATGCT 2 460 

AGAAAAGCTA GGGTTGAGAG TTGGCGTGTT TAGCTCGCCC TATCTCATTC ATTACACAGA 2 520 

CCAGATTAGC ATCAATGGGG AATCGATCTC AGAAGCGAGG CTAGAAGCTC TCATGGCAGA 2 580 

CTATCAGTCT TTGCTGGAGG GAGAAGCGGT CGCCAATTTA CAGGGCACAA CCGAGTTTGA 2 640 

GATTATCACA GCCCTGGCCT ATGACTACTT TGCCTCAGAG CAAGTAGATG TGGCCATCAT 2 700 

GGAAGTTGGC ATGGGTGGAC TTTTGGATAG TACCAATGTC TGTCAGCCCA TTTTGACAGG 2 760 

AATTACAACT ATTGGCTTGG ATCATGTGGC TCTACTTGGT GACACCTTGG AGGTCATAGC 2 820 

AGAGCAGAAG GCAGGTATTA TCAAACAAGG GATGCCCTTG GTAACAGGGC GTATTGCTCC 2 880 

AGAAGCCTTG GCTGTGATTG ACCGCATTGC GGAAGGGAAA GATGCGCCGA GACTTGCCTA 2 940 

CGGGACAGAT TATCAGGTTC GTCATCAAGA AAGTGTGGTG ACAGGGGAAG TCTTTGACTA 3 000 

TACAAGTGCT GTCAGACAAG GTCGCTTCCA GACTAGCCTG CTTGGTTTGT ACCAAATAGA 3060 

GAATGCTGGG ATGGCCATAG CTTTACTTGA TACTTTTTGT CAAGAAGATG GTCGAGAGCT 3120 

AGCAAGCAAT GATTTTCTTG GTCAAGCCTT GGAAGAAACA AGTTGGCCAG GGCGTTTGGA 3180 

AATCGTGTCA AGAGATCCCT TGATGATTTT GGATGGAGCC CACAATCCCC ATGCTATCAA 3 240 

GGCCTTGTTG GTAACCTTGC AAGAACGTTT TGCGGATTAT CATAAGGAAA TCCTCTTCAC 3 300 

TTGTATCAAA ACCAAGGCCT TGGAGGATAT GTTGGACTTG CTGGGAGCCA TGCCAGTTAC 3 360 

CGAGCTTACT CTAACACATT TTGCGGATAG TCGGGCGACG GATGAAAACG TGCTGAAAGA 3420 

GGCAGCTAAG TCTAGAAATC TCAGCTACCA AGATTGGCAT GATTTTCTAG AGCAGAATTT 3 480 

GACAGATAAA AAAGAAGAGA AACAAACAGT TAGGATTGTC ACAGGTTCCT TGTATTTCTT 3 540 

GAGCCAAGTG AGGGCCTATC TGATGGAGAG GAAGAACGAG AATGGATACA CAAAAGATTG 3 600 

AAGCGGCTGT AAAAATGATT ATCGAGGCTG TAGGAGAGGA CGCTAATCGC GAGGGCTTGC 3 660 
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AGGAAACACC TGCTCGTGTA GCCCGTATGT ATCAAGAGAT TTTTTCAGGT CTTGGTCAAA 3 720 

CAGCAGAGGA ACATTTGTCA AAATCCTTTG AAATTATTGA CGATAATATG GTGGTAGAAA 3780 

AGGATATCTT TTTCCATACC ATGTGTGAAC ACCACTTCTT GCCATTTTAT GGTAGAGCGC 3 840 

ACATTGCCTA CATTCCAGAT GGTCGTGTGG CAGGCTTGTC TAAGCTAGCC CGTACGGTTG 3 900 

AAGTTTATTC GAAAAAACCA CAAATTCAAG AACGTTTGAA TATCGAAGTG GCCGATGCCT 3 960 

TGATGGACTA TCTAGGTGCT AAAGGAGCCT TTGTTGTCAT TGAGGCGGAA CATATGTGTA 4 020 

TGAGTATGCG TGGTGTTAGA AAACCAGGCA CTGCAACCTT GACGACAGTA GCTCGTGGTC 4080 

TATTTGAAAC AGATAAGGAT CTCCGTGACC AAGCTTATCG TTTAATGGGG CTATAAAAAG 4140 

AATCCGCTTC AAGCGGATTT TTCTAGAAAG GAATCATTAT GGATCAACTG CAGATTAAGG 42 00 

ATTTGGAAAT GTTTGCCTAT CATGGTCTTT TTCCTAGTGA GAAAGAATTG GGGCAGAAAT 42 60 

TTGTCGTTTC AGCCATCCTA TCCTATGATA TGACCAAGGC AGCTACAGAC TTGGATTTAA 43 20 

CAGCCTCTGT CCATTACGGA GAATTGTGTC AGCAGTGGAC GACTTGGTTT CAGGAAACGA 43 80 

GTGAAGATTT GATTGAAACG GTAGCCTATA AACTGGTGGA ACGTACCTTT GAGTTTTATC 4 440 

CTCTTGTCCA AGAAATGAAG TTGGAACTGA AAAAACCTTG GGCACCGGTG CATTTGTCAC 4 5 00 

TAGATACTTG CTCGGTAACC ATTCATCGCC GCAAGCAACG AGCCTTTATC GCCCTAGGAA 4560 

GCAATATGGG AGATAAACAA GCAAACTTGA AGCAAGCCAT TGACAAACTG CGAGCTCGTG 4 620 

GCATCCATAT TCTCAAAGAG TCCAGTGTCT TAGCGACGGA GCCTTGGGGT GGAGTGGAGC 4 680 

AGGATAGCTT TGCCAATCAA GTGGTTGAGG TGGAAACCTG GCTACCAGCA CAAGACTTGT 4 740 

TAGAAACCTT GTTAGCCATT GAGTCAGAGC TGGGACGGGT GAGAGAAGTG CATTGGGGAC 4 8 00 

CTCGTTTGAT TGATTTGGAC TTGCTCTTTG TGGAGGACCA GATCCTTTAT ACAGACGACC 4 8 60 

TCATATTGCC TCATCCTTAC ATAGCGGAAC GCCTTTTTGT CCTTGAGTCt TACAGGAAAT 4 92 0 

TGCGCCTCAT TTTATCCATC CGATATTAAA ACAACCGATC CGCAACTTGT ATGATGCTTT 4 980 

GAAAAAATAG AAAAACTCTA GTTTTCAGTT ACTTGCAACT GAAGGCTAGA GTTTTTATAC 504 0 

TCTTCGAAAA TCTCTTCAAA CCACGTCAGC GTCGCCTTAC CGTACTCAAG TACAGCTTGC 5100 

GGCTAGCTTC CTAGTTTGCT CTTTGATTTT CATTGAGTAT TAAAATAGGT CATTTTCTTC 5160 

TGGGAGGAGG ATAGTTTCTC TACCGTCCAT GTCTAAAACC AGTACTCTTG GGGGATAACG 52 2 0 

AGGGTCGAAA GGATGGTTAA AGTCAAAATC AATGGCTGTA GGGAGGTGTT GACTTGAAAA 52 80 

GTGGAAGGTA ATCTTTCCTT GGTTATTAAG CAATTGAAAC TCGAGTTCTT CTTCCAATTC 534 0 

AAAGACATTT TTTAAGAAAT GGTCGATGAT ATACCAAAAA GAGTCAATGA TGTCATCAGG 54 00 



WO 98/18931 



PCTYUS97/19588 



1298 

CAAGCTGGTA ACAATACCAA AACTAGCAGA TCGCATGTGG GTATTGGTAA AAGCCATATC 54 60 

TCTGTCCCCT TTCTTTTCCC TTATCATACA GCAAATAGGA TTAAAAATCA AGAAAAGGTG 5520 

ATTTTTTGAA AAGGATTTTA GTTACAGGGA GAAATAGGGA AAAAATTCCT AAAAATCTAC 5 580 

CGAAGTTAAT AGGTAAATTC CCAAATTAAC TTGATTATAT AACTTTCAGT TACTTTGAGA 5 640 

AGTTACCGAA AAATATTTTT CATATCTATT GACTTTTAGG GGTAAAATTT GGTATGATAG 5700 

TAGGCGGTAT TGTTTACCCC ATTTGAAAGG CCCCGGAACC TTCCAAATAC TTTTCGATGG 57 60 

GAAGGAACAC CCATCACCGT AAACAAAAAT CGAACTATAT ATAGGAGAAA TCATGAACAA 5 820 

AACAACATTT ATGGCTAAAC CAGGCCAAGT TGAACGTAAA TGGTACGTAG TTGACGCAAC 5880 

TGATGTACCA CTTGGACGTC TTTCTGCAGT AGTTGCTAGC GT 5 922 
(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19BB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 
TAACTATCTA CGATGAGCTG TTGTGATTCT CATTAGTTCC CCTTTCCCAA GAGGCATAGG 60 
GGTGCGCATA ATAGATGTGC TCCTCAGAAA ATATATCAAA CAAGCGATTG AATTCCGTTC 120 
CATTATCTGC CGTGATGGAA AGAATCTTGT GTTGTTTTAA GATGAGTTTT AGAGCCTGAT 180 
TGACCACCTC AGCACTTTTA TTTGGAATCA ATCGGATGAT CTGATGTCTA CTCTTTCGAT 24 0 

CCGTCAAGAC AATCAAGCAG TAGTTTTTCG ATCTCGTAAG TAGAACCGTA TCAATCTCAT 3 00 

AATGCCCATT CTCCAAGCGA AGATTGATAG CTTCAGGCCG CTGTTCGATG GATTGACCAG 3 60 

CAGGTTTAAA GTTGGTGCTA GCCTGTTTCT TAAGCGCTTT TCCTTTTCTA GGGTAAAGCA 420 
AATCCTGCTT GCTTAACCCC AATTTTCCAT GATGAATCCA ATAGTAAATG GTTGAAATTC 4 80 

CCACGTTAAC CCCTTTAGCC ATAACCATCA TTTCAGGCGA AAATTTTTGG TTATGATAGT 540 
GGAGAATCTT TTCCTTTAGT TCCTTGGTCA AGCTTGATTT CTTGACCGAG CGCTTGCGAT 600 
TGTTTTCATA AGACTGTTGA GCGTAGTCGG CAGAATAAAC CTCTTTGAAG CGCCCTTTTC 6 60 

CAAGACATTG TCGGACTGTC CCACGCTTGA TTTCAGTGTG ATAGTTTGAG GAGCTTTTCC 72 0 

AAGTAGAGAG GCAATTTCTC TATTTGATTT TCCTTCTTTT TTCCATCTTT CGATTAAGCG 7 80 

ACGGCTATCG ATTGTCAAAT GTTTGGCTTT TGTAGTATAA TTGTCTTGCA TCTCTGTGCC 84 0 

TTTCTTGTGT TTGTGGTTGA ACAACAAGTA TAACACAGAG GTGCTTTCTT ATGCCTACAA 900 
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GAGCTTTCAT TATTTCCATT TTCTTTTGGA TTTCACTCTA TTCTGAAAAA CTTGTGTATA 960 

TTTACTGAAG CTAGCAAGTC TTACCTGTAA ATTTAATGAA AGCAACACAA AATCCGAGAG 102 0 

GGGAATCTCG GATTAATAGA TAGAGAGTTT TTAGTTTAAA TAAATTGTTT AAAATATCAA 108 0 

CAACATCACT TCTTTTCTTA ACCTGATAAG TCTTGATTCC TAATTTTGGG GCTACGATTA 1140 

TATTGTCCTC AATATCGTCT AGAAAGACAC AATTTCTAGG TTATAACTGG TATTTATCGA 12 00 

TAGTTACTCA TATACATCAG TCCACCTCCA TACTTATGTG CGAGCCTCTC TTTGTATTAT 12 6 0 

ACCTCCATAC TCACCTTACA GATTCTTTTG GTAATAATAT CTTTGCCTAA TGTAGAGACA 1320 

GTCTTGCAAA GAAAAAACTT CCTTGTAGCC ATGTTTCTGA TAAAAGTCCG GTGCCTGGAA 13 80 

CTGGTAAGTA TTGACAAAGG CAAAACAACA ATTTCGATTC TTAGCTTCAC TTTCTGCCTG 1440 

TTGCAATAGT TTTGAACCGA TTCCTTGCCC TCGCAGTTCC TCTTTTACAA ACAAATACTC 1500 

GATTTCTAGC CAATTTCCAA AAGTCTCTGC TATCAAACCT GCCAGGAGAT TGCCCTTTTC 15 60 

ATCTTCGACA TAAAGATTAA GTGGCTCACT TTCAGCCTCT TCTCTTTTTG AACGGTTATA 162 0 

AACACGAATC AGATTCCCTA TTTCTTGCGA TTTATGTGAT TCCTTATTTT CCAATCTAAA 168 0 

GTATAGTGAA ATGAAATAAA ACATGCGCAA ATCGATTAAG GAATTTAATC TAATTTCTAA 174 0 

CAATGTCTTA GAAATCAAAG TGTACTATTT TAACTTCAAT GCACTATACA TCTAATACTC 1800 

AATAAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGGT TGCTCAAAAC ACTGTTTTGA 1860 

GGTTGTAGAT AGAAcTGACG AAGTCAGCTC AAAACATAGT TTTGAGGTTG TAGATGAAAC 1920 

TGACGAAGTC GGCTCAAAAC ATGGTTTTGA GGTTGTAGAT GAAACTGACG AAGTCAGCTC 198 0 

AAAACAGG 198 8 
(2) INFORMATION FOR SEQ ID NO: 269: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 269: 

CCGGATATTT GTTTTATGTA ATTTTCTTGC AAGTTTCTTC TTAGTAGCTT GTCAGTCAGG 60 

TTCTAATGGT TCTCAGTCTG CTGTGGATGC TATCAAACAA AAAGGGAAAT TAGTTGTGGC 120 

AACCAGTCCT GACTATGCAC CCTTTGAATT TCAATCATTG GTTGATGGAA AGAACCAGGT 180 

AGTCGGTGCA GACATCGACA TGGCTCAGGC TATCGCTGAT GAACTTGGGG TTAAGTTGGA 240 
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AATCTCAAGC ATGAGTTTTG ACAATGTTTT GACCAGTCTT CAAACTGGTA AGGCTGACCT 3 00 

AGCAGTTGCA GGAATTAGTG CTACTGACGA GAGAAAAGAA GTCTTTGATT TTTCAATCCC 3 60 

ATACTATGAA AACAAGATTA GTTTCTTGGT TCGTAAGGCT GATGTGGAAA AATACAAGGA 42 0 

TTTAACTAGC CTAGAAAGTG CTAATATTGC AGCCCAAAAA GGGACTGTTC CAGAATCAAT 4 80 

GGTCAAGGAA CAATTGCCAA AAGTTCAATT AACTTCCCTA ACTAATATGG GTGAAGCAGT 54 0 

CAATGAATTG CAGGCTGGAA AAATAGATGC TGTTCATATG GATGAGCCTG TTGCACTTAG 600 

TTATGCTGCT AAAAACGCTG GCTTAGCTGT CGCAACTGTC AGCTTGAAGA TGAAGGACGG 660 

CGACGCCAAT GCCGyTGCTC TTAGAAaATA GTGATGATTT GAAAGAAGT 709 
(2) INFORMATION FOR SEQ ID NO: 270: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1680 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 270: 

TATAAAATGT TAAGTTAAAT GATTTCAAAA TTCAGAAAGG GATTGCTTTA TGCAGTTCCT 60 

TTTTATTTTA ACAGGAGTGA AACTATAGTG TTTCTAAATT GTGAATCAAT CAAAACTGAT 120 

TGTGATGGGG CTATTCTAGC TTTAGAAACC TTCAAAAATT AAAATTTAAG GCAATCAATT 180 

ACTTGGAAGA GTATGAAAGC ATTTAGTTTA TAGGAATTCT AGGTCTAGAA TTACATATAT 240 

ATATTTATGA AGACGGGGTG TTCGATAGTT AGTATTGTTC TATTCTGAAA GATTTGAGCT 300 

GTCAGTTGTA TAGAAAGTGT TCGAATTTTT TTAAGTGATT AAATTAGTTA ATTGTATGAG 360 

GTGCTTTATG ATATAATGTT CTTAATGAAT TTTCAGAAAG GAAAACCTCA AATTGTTCTA 420 

CAAATTTCTA CTCTTCGACC TCGACCACAC TCTTCTTGAT TTTGATGCTG CTGAGGATGT 480 

GGCTTTGACC CAACTTCTAA AAGAAGAAGG AGTTGCGGAT ATTCAGGCTT ATAAAGAT'iA 540 

TTACGTTCCT ATGAACAAGG CTCTCTGGAA AGACTTGGAG CTGAAGAAAA TCAGTAAACA 600 

AGAGCTGGTT AACACGCGCT TTTCTCGTTT ATTTGCTCAT TTTGGACAGG AAAAAGACGG 660 

TAGTTTTCTT GCCCAGCGTT ACCAATTTTA CCTCGCCCAG CAGGGACAAA CACTATCGGG 720 

CGCTCATGAT CTCTTGGACA GCCTCATTGA GCGTGATTAT AACTTGTATG CTGCGACAAA 780 

TGGCATTACT GCCATTCAGA CAGGACGTTT GGCTCAATCT GGTCTAGCAC CTTATTTCAA 840 

TCAAGTCTTT ATCTCAGAAC AGTTGCAAAC TCAAAAGCCG GATGCTCTTT TTTATGAAAA 900 

GATTGGCCAG CAAATTGCTG GATTTAGTAA AGAAAAGACG CTGATGATTG GAGATTCTCT 960 
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AACCGCCGAC ATTCAAGGTG GCAATAATGC GGGGATTGAC ACTATCTGGT ATAATCCTCA 1020 

TCACCTCGAA AATCACACAC AAGCCCAGCC GACTTACGAA GTCTATTCTT ACCAAGACTT 1080 

GCTGGATTGT TTAGATAAAA ATATTCTTGA AAAGATCACA TTTTAAAGGA GACGAGCTAA 1140 

TGACTACAAA AAAGCTAATA TTACTATTGA AGAGTACATT GAAATGTCTG AAGTTGATTT 1200 

TAATGAAGCT GTTAATTATG AATTTACATC TGACACTTGT CAATTAGCAA ATAGTATTTA 12 60 

TCAATCTCTT TTTAAGTTTT TTGATAAGAA AAATTTCTCT GGCGATTTAA TTTTTACTTG 1320 

GAAATCTCCA TCATTAGTCA AAGAAGGGGA TTATATTGGG AGAAGGGATT CACAAGTAGA 13 80 

TAATCTTAGA GTAATAGGAA ATATATTTCC GAATTATCTT ACTAATCGAA AATATAGCCT 1440 

CAATATGAAT CGTAATGGCT GTATGGGAGA TTTTCCTCAT GACTTTTTTG ATATATACCT 1500 

AGATCATGTA GCAAAATATG CCTACGAACA AAAAGTTAAT AATATTAAAG AGTATTATCC 15 60 

TTTAAAAAGA GCGATTTTAC ACCAAGAGAA TGCATTGTAT TTTCGATTTT TTTCTAATTT 162 0 

TGACGACTTT TTAGAAAAAA ATTATTTAAA GACTATATGG CAAGTTTCTA AAGAAACTCC 1680 
(2) INFORMATION FOR SEQ ID NO : 271: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 598 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 271: 

AGCTCGGTAC GTAGTATnTG TGGTGCATAA ATGAGTGAAA AGAGGATAGA GAGGATGAGG 60 

CCGATAAGAA CACCGGTAGC TGCATCGTGA AATACTTGTT TTTTCATAGT TCTAATTTCT 12 0 

CCTTGATGGT TTTTAGATAA CGGCGTGAAG AGTAGGTGAA GCTTTCGTTT TTCAAGAAAA 180 

TTTCTACCAG ACCGTTTGGC GTGAgCTTGA GGTGAGAGAT GGAATCGATA TTGATGATTT 240 

CTGATTGGGA AATTTGGATA AAATTGGTTG GCAAGAGTTT AAGAACCTGA TAGAGTCGCA 3 00 

AATCAATGCT GTAGGTCTGA CTCGCGGTTT CTGCTAGAAC CTTCCGATTC TCGATATAGA 3 60 

AGCGCTGAAT CTTGCCAATC TCAACTAGAT AGACCTGATC ATCGATTTTT CCTTTGATTT 42 0 

TTTCTCTTTG GTCCAGATTT TCTGCGAACT CGATGACTTT CTGGACTTTT TCGGTTTCTT 4 80 

GAGGTGCTTG GACAATCAGC TTTTCCTCCT CGTAAGTCTC ACTAATCTGT AGTTCTACTT 540 

TCATAGTTTT CTCTCCTTTT CAGTTATACA AGGTTGTGAT CACTTCCTGT ATATCCGG 598 
(2) INFORMATION FOR SEQ ID NO: 272: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 
{B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 

CCAGCAAATC AATAACTGCA ATTGCTATAA AATGGATTCT ATAGAGTTTT TTCATGACAA 60 

GACCTCCCTC TTTTATCTAA CTTCATTCTA CTCCAAAAGA ATGGGAGTTA CAACTAAAAT 120 

GATAAAAATA GCAGAAGGGA GATTCTCTTA AGTTGGCTAG TATTCTTTAT TTGAGTTTCC 180 

TTCTATTATC TAACTTCTTC ATCATTCCAG ACAAATAAAG CTCCGATTGC ATTGAGGATA 240 

TAAAAGATGT ATTTACCGAT ATTGGCGAAG TTTCCTTGAA TACCAGCTTT TGTCAGCTGA 3 00 

ACGAAATTGT AAATCAACCA AAAGCCCCAC TGAGTTGTTA GTTTTAATGC ATTCAAAGCA 3 60 

TTGGCAATGA GGGACAGTGC AAAGGCAATA GTTGTTACGT AGGCAAGGAG ATTCATCTTG 420 

CCCCCATATC CGATATAGTT GGTCACAAAG GCAAAGAGGA AGGCGATGAT GGAAATGATG 480 

ATGGCCGCCA ATTTTACCTG TTTTTGGCTC ATTTGGTTGG GTCTGCCTTC TTGCGAAGCT 540 

TCCCACTTCT TTATAGCAAA GGTATAAATG AGGAAGGTGA CGGGATAGGT AATGATGGCC 600 

GCCTTATTTC CAAGGATATA ATCAATAGCA CCGGACAAAA TGGTATTAAC AATACCAAAG 6 60 

TAATTTCCCC ATTTGCTTAA TTTCCCCGTG AAACGAGTGG ACAACATGGA AATCCCAACG 720 

TTGGTTACGG AAATCAATCC AAAGGGTACA AGAGCTGTCC ATGATCCCCA GTCTACAAAT 780 

TTATCGAGGT GTGAGTTGAG GTAACCAGAT GCAATCGCAA TCCCAACGAC CAAAGCAACC 840 

CCGAAGAGGT CAAACTATTT AGATGTAGCA AAAATTTTTA GTGATTTTTT CATAGGTTAA 900 

ACTACCTTTC TTTTTTTCAA ATATTCTCCC ACCAAATGAA AGTAAAATAA AATGATAGAA 960 

ATAAAACCCT GAAAATAAAG GTTCTATAAT ATTTGTAGTG GGTAAATCCA CTATAGATAT 102 0 

TATGGAGCCT ATTTTATTGT AGAAAAAAAG TCCCATATGA CCTATAATGA AAAGCGACAA 1080 

AACAACTCAT TAGAAAGAT 1099 
(2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2723 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
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CTGGGATTCA CGTGAAAAGG AAGCCCAGAG AGTAGCCAGG TGTACTGCTA GAACAGTGAG 60 

TGAAATTGAA TATTACCATA GAGAGTCAAC CCAGATAGCT CAGGCTTTAG TTGAAAATCA 120 

AGCTCGTATC GAGGGAATCT ATAAATACTT TAGCCTTAGC ATGCCAGACT ATTTTTACTG 18 0 

GCAATTAGAG CGGAAAGCTT CGCCTTATAT ATCAGTCTCT CTGTATGAAA ATGTTGATGA 240 

CCTCTATGTT CGAAATGATT TTGTAACTGG GGTGGCCATT GCTTTTCAAG ATTACAAGGA 300 

AGTCTATGTT TCTACTAAAG ACAAACGTAG GkkAGAAAAA ATCAGGGCTG AGGATTTCAA 360 

ACCAGCAGGA AATAGTTTTG CCATTCCAGT GTCAGATCCA GTGTCAGATC AAGACTTAGG 420 

AGTGATTTAC ATCTCCTTGG ATCCTGCTGT TTTATACCAT GCCATTGATA ATACTAGAGG 480 

TCATACTCCG ATGGCAGTAA CAGTGACCTC ACCTTTTGAT ACGGAGATTT TTCATATGGG 540 

TGAGACAGTT GATAAGGAGA GTGAAAATTG GCTAGTTGGC TTAACTTCTC ATGGATATCA 600 

GGTTCAGGTG GCAGTTCCTA AAAACTTTGT TTTACAAGGA ACAGTGACTA GCTCTGCTTT 660 

GATTGTGGGT TTGAGCCTTC TCTTTATTGT CATTCTTTAT CTGACTTTGA GGCAGACTTT 720 

TGCTAATTAC CAAAAGCAGG TAGTGGATTT AGTAGAATCC ATTCAAGTCA TTGCTCAAGG 780 

CGAAGAGGGG CGTCGGATTG ACATTTCCGA GAAAGATCAG GAATTACTCC TAATCGCGGA 840 

GACGACCAAT GATATGTTGG ATCGATTGGA AAAGAATATC CATGATATTT ACCAGTTAGA 900 

GCTTAGTCAA AAAGATGCCA ATATGCGAGC CTTGCAGGCG CAAATCAATC CTCATTTTAT 9 60 

GTATAATACG CTGGAGTTCT TGCGCATGTA TGCAGTTATG CAGAGTCAAG ATGAGTTGGC 1020 

AGATATCATT TATGAATTCA GTAGTCTCTT GCGTAACAAT ATTTCCGACG AAAGAGAGAC 1080 

CCTCCTCAAA CAGGAATTAG AATTTTGCCG TAAATACAGC TATCTCTGCA TGGTTCGCTA 1140 

TCCCAAGTCC ATTGCCTATG GTTTCAAGAT AGATCCAGAG TTAGAGAATA TGAAGATTCC 12 00 

CAAGTTTACC TTGCAACCGC TGGTAGAAAA CTATTTCGCG CATGGTGTTG ACCACAGGCG 1260 

GACAGATAAT GTGATTAGCA TCAAGGCTCT TAAACAGGAT GGTTTTGTGG AAATTTTGGT 13 20 

GGTCGATAAT GGTAGAGGAA TGTCGGCTGA AAAGTTGGCA AATATCCGAG AAAAATTAAG 13 80 

TCAGAGATAT TTTGAACACC AAGCCAGCTA CAGTGATCAA AGGCAGTCTA TCGGGATTGT 1440 

CAATGTACAC GAGCGTTTTG TGCTCTATTT TGGAGACCGC TATGCCATTA CTATAGAGTC 1500 

TGCAGAGCAA GCCGGTGTTC AGTATCGTAT TACAATTCAA GATGAGTAGA AAGGGAGAAA 15 60 

ATGTATAAAG TATTATTAGT AGATGATGAG TACATGGTGA CAGAAGGTCT GAAGCGTTTG 1620 

ATTCCCTTTG ATAAGTGGGA TATGGAGGTC GTCGCAACAG CCAGTCATGC CGATGAAGCT 1680 

CTAGAATATG TTCAGGAAAA TCCTGTCGAT GTCATCATTT CCGATGTCAA TATGCCAGAC 1740 
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AAAACAGGGC TTGATATGAT TCGGGAGATG AAAGAGATCT TACCAGATGC TGCCTATATC 1800 

CTGCTCTCAG GTTATCAGGA GTTTGATTAT GTAAAAAGAG CAATGAACCT TAGTGTGGTG 1860 

GACTATTTGG TCAAGCCTGT TGATAAGGTA GAGCTGGGAA ATCTGCTGGA GAAGATTGCA 1920 

GGTCAGCTCG GCGAGAGAGG GAAGAAAAGT CAGACTCTTA GTCAAGAATT AGACGAGGCT 1980 

GGATTTGTTA GTTATTTAGG GGATAAGGAG AATTGGTGGA TAGGTCTATC CAAGGAAAAA 2 040 

CAAGGTTCCT TCACCATTCC CTACTATGTC TTGGGTCAAG ACTGGCAGAT TTTCATTTCT 2100 

GGCCACCCCC TAGATGGTTT AGTCGTTACA CCTTTTGAAG CTCCTTATCA AGAACACTTT 2160 

GAACGCTGGA AGCTGAATGC TGAGAAAACC CTCTTTTACG GTTCTGTAAA TCTGCAGCAG 2220 

TCTGAGAGTC TCTTTGCCTA TTACGAACCG ATTTATAGGG TTATCATTCA GGGAAATCTC 2 2 80 

AATCAAATCG TAGAAGAGTT AAATCTCTTG GAGAAGGTAG TTCTTGAAAA TACACCTCGT 2340 

GTTTCGATTA CTAAACAGCT TTTTATCCAG TTTGTCATGG ATGTTTTCCA TTTATTTGAA 2400 

CATCTCAAAG CTGATGATAT GACGGACATT GTCAAAACCA TTCATGCTAT TCAATCCTTC 2 4 60 

GATGAATTGG TTTCTTATAT CAAGGAAACT CTGATCAGCT TTTTCGGTCA ATACCGTATG 2520 

AATGAAAATG TGGTCAGTGT GCTGGAAGTC ATTGGTCGTG ATTACCAAAA AGAGCTTTCC 2 580 

CTCAAGGATA TCAGTAAGGC CCTCTTTATC AATCCTGTCT ATCTAGGGCA GTTGATTAAG 2640 

CGTGAAACCG ATTCGACCTT TGCAGAGTTA CTAAACAAAC AACGTATTAA GGCTGCCCAG 27 00 

CAGCTCTTGC TTTCAACTAG TGA 2723 
(2) INFORMATION FOR SEQ ID NO: 274: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 

CCGCAGTTTT TTTAAACCGT ATATAAGTAT AGCATAGTCA AAAAAAGAAT GCAAGATTTT 60 

TGCAAACTTT TTTAAAATTT TTCGTAATTT TTCTTTTAAA GTTCTACTGT CAGGACTTGA 120 

CCTTGCTTAA CAACCTGTTC TCCGGCGATA TAAACATCAT CTACATCACT AGATTTAACT 180 

GCATAAACCA GGTGAGACAG CATATTTTCC TGAGGTTGGA GATGAATTTT CCCTTGTGGT 240 

TGAATGACCA GAAAATCTGC TTGCTTGCCG ACTTCCAGAC TTCCTATCTG ATTTTCCATT 300 

CCAAGGACCT TAGCCCCTTC GATTGTCAGT ACCTTGAGAG CTGTTTCGAT TGGAAACTGG 3 60 

CTGGCATCCC CACTTTTCAT CTTCTGAAGA AGAGCTGCAG TCCTTCCTTC CTCAAACATA 420 
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TCTAGATTGT TATTGGAAGC AACCGAGTCA GTCGCAATTC CGACTGCTAC TCCCGCTTTT 480 

TGGAGCTGGA TAATTGGAGC AATTCCTGAT GCCAGTTTGA GGTTACTGAT AGGATTGTGG 540 

GCGATAGCnA CTTGAGAAGA TGCCAAGCGT TCAATTTCTC TCTCGTTTAA TTCGACCCCG 600 

TGAGCAAATA CGGACGGATG ATCTAAATAA CCCAGTTCTT CAAGAAAAGC AAGGGGGCGT 660 

TTGCCGTATC GTTTGAGGAT AATTCCTGAC TCCTCCTTGG TCTCCGCCAC ATGGACATGG 720 

AGCGGAATAT TTAGCTCTTT TGCCATTTCC AAACTCGCTT CCAGCAAGTC TCTACTGCAG 780 

CTATACGGAG AATGAGGTGC TACCATAACC TTGAAATTTG GATTTTTATA TTTTAA 83 6 
(2) INFORMATION FOR SEQ ID NO: 275: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2335 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 

ATTTTATTTC ACTTTTTAGG TGGTCTGGGG CTATTCTTAT ATAGCnTCAA GACCATGGGA 60 

GACGGTTTAC AACAAGCTGC TGGAGATCGC CTGGGTTTTT ACATTGACAA ATATACTAGT 120 

AATCCTTTGT TTGGAGTTCT GGTTGGTATT GGGATGACTG CTCTAATTCA GTCTAGTTCT 180 

GGTGTAACAG TTATCACAGT CGGCCTGGTC AGTGCCGGTC TCTTAACCTT ACGTCAGGCT 240 

ATCGGGATTG TCATGGGTGC TAATATTGGG ACAACTGTCA CATCCTTTCT CATCGGTTTT 3 00 

AAATTAGGTA ACTATGCCCT ACCTATGCTC TTTATCGGTG CCGTCTGTCT TTTTTTTACG 360 

AAAAATCGGA CAGTCAATAA TATCGGACGC ATCCTCTTTG GTGTCGGTGG TATCTTTTTT 42 0 

GCCCTCAATC TCATGAGCGG CGCAATGGCT CCACTCAAGG ATTTACAGGT CTTTAAGGAC 4 80 

TATATGATTG AGCTAAGTAA GAATCCTGTT TTGGGTGTCT TTGTCGGTAC TGGCTTGACC 540 

TTGCTAATTC AAGCTTCTTC GGCTACCATT GGGATTTTAC AAAACCTCTA CGCCGGCAAT 600 

CTAATTGATC TACAGGGAGC TTTGCCAGTT CTATTTGGTG ACAATATCGG GACAACCATT 660 

ACAGCCATCA TTGCCTCTTT AGGGGCTAAT ATTGCAGCTA AACGGGTAGC AGGAGCTCAT 720 

GTTGCCTTCA ACGTTATCGG AACAGTTGTC TGCGTTATTT TTCTAGTTCC TTTTACTGTC 780 

CTGATTCATT GGTTTGAAGC TACGCTAAAT CTAGCACCGG AAATGACCAT CGCCTTTGCT 840 

CACGGAACCT TTAATATTAC CAACACCATT GTCCAATTTC CATTTATCGG AGCTCTGGCT 900 

TACTTTGTAA CCAAGATTAT TCCTGGAGAG GACGAGGTTG TCAAATACGA ACCCTTATAT 960 
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CTCTTGCACT TAGGAAACTA CGCTGCTAAA GCCTTTGACC TTTCCTATAA GTACATCATT 1080 

GACTTGGATG AAAAAGTTGC TGAAAAAGGG CATAAAACCG AAGAAGCAAT TAACACCATC 1140 

GATGAGCAAT TAACACGTTA TCTCATTGCC CTTTCAAGCG AAGCTCTCAG CCAAAAAGAA 12 00 

AGTGAAGTGC TTACCAATAT CCTTGATTCC TCCCGTGATT TGGAACGGAT TGGAGACCAC 12 60 

ACGGAGGCTC TACTCAATCT GACTGACTAT CTTCAACGGA AAAATGTTGA ATTTTCTGAT 132 0 

GCCGCCTTGA AAGAATTAGA GGAAGTTTAC CGCCAAACTA GTGACTTTAT CAAAGATGCT 1380 

CTGGATAGTG TGGAAAACAA TGATATTGAA AAAGCACGCA GTCTTGTAGA ACGTCATGAA 1440 

GCAATCAATA AGATAGAACG TGTTCTCAGA AAAACCCACA TCAAACGCCT CAACAAAGGC 1500 

GAATGTTCAA CACAAGCTGG GGTCAACTTT ATCGACATCA TCTCACACTA CACTCGTGTA 1560 

TCAGACCACG CTATGAACCT TGCTGAAAAG GTTTTTGCAG AACAAATCTA AGAACCAAGA 162 0 

AGCTATCCAT CATAATTGGA TGGCTTTTTA CTTTTTCCTA AGCAAGACTA GGATGAATGA 1680 

AACTGAAAGA GTATTCTGCA GATATATAGT CCCCAATTAT TCACCCCAAA TCTAAAAACC 174 0 

ATCCAGAATC CTTGCCTTAG CTTAGATCCT GGATGGTTTC TTTTTTCACC CAATGGGTGT 1800 

TTTTTACTAG ACAAAAAAGA GTTTCCCCTT TATGGTATAA GTGTAGAAAA AAACACAAAA 1860 

AGAAAGGAAA CTCACATGAA CAGTTTACCA AATCATCACT TCCAAAACAA GTCTTTTTAC 192 0 

CAACTATCTT TCGATGGAGG TCATTTAACC CAGTATGGTG GTCTTATCTT TTTTCAGGAA 1980 

CTTTTTTCCC AGTTGAAACT AAAAGAGCGG ATTTCTAAGT ATTTAGTAAC GAATGACCAA 2040 

CGCCGCTACT GTCGTTATTC GGATTCAGAT ATCCTTGTCC AGTTCCTCTT TCAACTGTTA 2100 

ACAGGTTATG GAACGGACTA TGCTTGTAAA GAATTGTCAG CTGATGCCTA CTTTCCAAAA 2160 

TTGTTGGAAG GAGGGCAGCT TGCTTCACAG CCAACCTTAT CCCGTTTTCT TTCCAGAACT 2220 

GACGAGGAAA CAGTCCATAG TTTGCGATGC CTCAACCTTG AATgGkCGAA TTCTTTTTAc 2280 

AGTTTCACCA GCTAAACCAA CTCATTGTAG ATATCGATTC TACCCATTTC ACAAC 2335 
(2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

CGGATTCACT GTTGTTGACT AATCAATAAC ACAGTAGAAA ATCTCACAGC AGTCTATTAG 60 
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TTGCTTTTCA TACTAGGCAA GTGACTGAGG CTTGTACTTG GGTACAGCAA GGGAGCTTAA 120 

GGCCGTAGAA GAGAAAAATA GTAGACTGAA AACCCGCAAG ACTTCATCAT TTCGAGAAGT 180 

GACGTGGGAG ATGAAAATCG ATTGAACCAC TTACAAGGAG AATAGAAAAT GGCTAAAAAA 240 

AGCAAACAAC TTCGTGCTGC TCTTGAGAAA ATCGACAGCA CAAAAGCATA CAGTGTAGAA 3 00 

GAAGCTGTAG CACTTGCAAA AGAAACTAAC TTTGCAAAAT TTGATGCAAC TGTAGAAGTT 3 60 

GCTTACAACT TGAACATCGA CGTTAAAAAA GCTGACCAAC AAATCCGTGG AGCAATGGTA 42 0 

TTGCCAAACG GTACTGGTAA AACTTCACGT GTTCTTGTTT TCGCACGTGG TGCAAAAGCT 480 

GAAGAAGCAA AAGCTGCTGG TGCAGACTTT GTTGGTGAAG ATGACCTTGT TGCTAAAATC 54 0 

AACGACGGTT GGTTGGACTT CGACGTAGtT ATCGCTACAC CTGATATGAT GGCTCTTGTT 600 

GGACGTCTTG GACGTGTCCT TGGACCACGT AACTTGATGC CAAACCCTAA AACTGGTACT 660 

GTAACAATGG ATGTTGGCAA AGCGGTTGAA GAGTCTAAAG GTGGTAAAAT CACTTACCGT 72 0 

GCTGACCGTG CAGGTAACGT TCAAGCAATC AT 7 52 
(2) INFORMATION FOR SEQ ID NO: 277: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2643 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

GTCAACATTG ATTTCAAGGC TGTTTGCTTT CTATCTCCCC TTTTTCATAA TGTATAATAA 60 

AATGAAATAA TAACAGGACG AATTGATCGG GACAGTCAAA TCGATTTCTA ACAATGTTTT 120 

AGAAGTAGAG GTGTACTATT CTAGTTTCAA TCTACTATAT TTTCGTACAG GTGCTTCAAC 18 0 

CATTTGAACG ATTTCAAATC CTTCTTTTTG GTAAAGATTC TGAGCTCTTT GATTTGCCTC 240 

GAAGACATTT AGAGAAATAC TGTCTATATC TCTATTTTCA AATGCTAAAC TAACAAATTT 300 

CCTTAAAGCC TTGCTACCTA AGCCTTGCTC CTGTTTCTGG GGGTTGATAA AAAATCTCCC 3 60 

GATATGAAGA TTGCTGTCTT CTAGCCTGAT TTTCTGGATA AATCCCACAA ACTCTTGTTC 42 0 

ATCAAAGATT GAAAAGACTC CTTCCAAGGC TTGAAGTGTC AGTAGAAAAG GAATCCTTGG 4 80 

TCCCATCCAT TGTTCTTGAA AGGATTTGCC TAGGGAGTTG GACCACTGGC ATACAAATTG 540 

AGCGTTTTCT GTGCTCACCT TTTCTTCAAA ACGAATTGTC ATCTTTTCCT CACCACCTTA 600 

TCTATGTTTC TCCATTATAC TATTTCTCCC ATTTTTTACG AATAGATAAG TATGATTGAT 660 
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TTTTATTTTT TTCTCGTCGG GAGCATTCTA GCTTCCTTTC TTGGTTTGGT CATTGACCGT 72 0 

TTTCCAGAGC AATCCATTAT CAGTTCAGCC AGTCACTGCG ATTCCTGTCA GACTCCCTTG 780 

CGTCCCTTAG ATTTGATTCC GATTCTCTCA CAGGTCTTCA ATCGCTTTCG CTGTCGCTAC 84 0 

TGCAAAGTTC GCTATCCTGT CTGGTATGCC CTCTTTGAAT TAAGCTTAGG ACTCCTCTTT 900 

CTGCTTTACT CTTGGGGATG GCTCTCCTTG GGGCAAGTCG TCCTAATCAC CGCTGGTTTG 960 

ACCTTGGGTA TCTACGACTT TCACCATCAG GAATATCCCT TACTGGTCTG GATGACTTTC 1020 

CAGCTAATCC TAATAGCTTC CTCTGGCTGG AATCTGGTCA TGGTCTCCTT CCTCATACTT 1080 

GGAATTTTGG CTCATTTTAT CGATATCCGC ATGGGTGCAG GGGATTTCCT CTTTTTAGCT 1140 

TCTTGTGCTC TCGTCTTTAG CGTAACGGAG TTACTGATCT TGATTCAGTT CGCTTCTGCG 1200 

ACGGGTATCC TGGCCTTTCT CCTGCAAAAG AAAAAGGAAA GACTTCCTTT CGTGCCTTTC 1260 

CTCTTACTTG CTACTTGTTT GATTATTTTT GGTAAGCTAC TGCTTGTCTG ATAAAATCCA 132 0 

ATTTCTGCCA TATATCCTTC ATGAAATTAT TTCACAGTTA AATTATAAAT TATTTCTTTT 1380 

GTACAAAGGG ATGATGTTAT CAAATCGATC TGTTCTTCTA TCTTCTTGAT ACTGATCAAA 1440 

AAATTTCATT TCGACTGAAA ATATTTCGCT TATAAACTGT AAACGAATAC TTTGTTTAGA 1500 

CATTATAGTC GCTAGACTGA CTAGATGATT ACTCAAAACG ACGTCCAGAA TACTCTTTAC 1560 

TTTGCTTGGT TTTTTAACAA AAATTTGATC ATCCAAGGGT TCAATCATTT TGTAACCTTT 1620 

TTGCGCAATT TGACGATAAA AGTAAGAATG TTGCTTTGGA GTCAATAATC CTAACTTAAA 1680 

AGCTCGATAC TCTAAAGCCT GTATCGAAAC ATTCAAATCC GACTTCAATA AAATATAACT 1740 

ATCAGGATTG CTGACACGCT TGCCAACCCT CTCTTCAAAT TTGACTAAAA ACTCTTCTTT 1800 

TGGCAATAAA AAACATGATG CAAAATAATT TGCTTCTTGC TCCAAACGAT CGCCATCTTC 1860 

ATTCATATCT TTATATTTAT GTAAAAGAAT ATGTCCTAGC TCATGAGCTA AGTCAAAATT 1920 

TCGACGTACA GATGATTTAT TCGTTCCTAA CACAATATAA GGTCTTCCCA ATTTTGACCA 1980 

TGCGCTATAA GCATCAGCTT GGCCATTAAT TAATCGTTCC ACGATATAGA TGCCTGAACG 2040 

TTCTAATTTA TAAAGCAAAT CATGATTATC TTTTGAAATA CCTAATTTTT CCCTGGCATA ilOO 

AAGAGCCAAT TCCTCAATGG ATTCTCCCTT ATGATAAGAT TCACTCACTA CATTACTTAG 2160 

GTCATGAATT ATAATATTAG GTATAATTAC AAAACTTTCA AAATAATCAA TCAAACTATC 2 220 

TACCTTATGT AAATACATAG TTTGAATATC TATTGTTTTC CGTGTTGCTA GGTCTGCATT 2 280 

TCTAAAGGCA ATTACAGAAG AATCAAATCG AATGCTCTCT TCTTCCTGTT CAAAATAAGT 2340 

TAAATCAACA TGAAATTGGT TGGCCAAATG CATTTTGGTT GATAATTTAG GTTTCGTTTC 2400 

GTTGGACTCA AACTGCCAAA TGGCTTGTTC CGTTAAATTA ATTCTCTGAG CTAATTCTGC 2460 
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TCTACTTAAA CCATTTAACA GCCGTAATTC TTTCAATACC CGACCATTAA AC AT T T AC AT 252 0 

ACTCCTTACT ACTTTTGACC TTCTTGTTTT TCTATTCTTG GAATAATTTC AAAATCTTCT 2580 

GTTTCCGATA ATTCTGAAAA ATTAGGAATA TCTTGATATT TAGCTTCTTC GAAATGGTAC 2 640 

GGG 2643 
(2) INFORMATION FOR SEQ ID NO : 27 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278: 

TGACCAGTGG CAAAATGGCT ATCCAAATGC AGATGTTATT ATCGATGATA TCATCTCAGG 60 

GCAAGCCTAC GTAGCCTTGG AAGAGGGAGA ACTGCTAGCC TATCCTGCTG TGACCAAGAG 12 0 

TCCAGAGGAG GCCTATGAAG CTATTTATGA GGGAAACTGG CAAGCTGGAG AGTCAGAGTA 180 

TCTAGTCTTT CACCGTATTG CTGTGGCAGC AGATGTGCAG GGAAaAGGAG TTGCTCAAAC 24 0 

CTTCTTAGAG GGCTTGATTG AAGGTTTTGA TTATCTTGAT TTTCGCTCAG ATACGCATGC 300 

TGAAAACAAG GTTATGCAAC ATATTTTTGA AAAACTTGGT TTTAAACAAG TCGGTAAGAT 360 

GCCAGTAGAT GGCGAACGCT TGGCCTATCA AAAATTAAAG AAATAATGCA AAAGAAGTAT 420 

GTAAAAATCC TCTACTCCTC ACCAATTGGT ATTCTATCAC TTGTAGCTGA TGACCATTAT 480 

TTGTATGGAA TTTGGGTTCA GGAGCAGAAG CATTTTGAGA GGGGACTAGG AGATGAAACG 54 0 

ATAGAAGAAG TTGTwAGTCA TCCTATTTTA GACCCAGTTA TT 582 
(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(□) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

CCCAAGCTAC TAAGAGACTA AAACTTGCTA GAGAAGCAAG AGAAAGTGTG AATCTTTTTA 60 

ATTTCATGAT GAATTTCCTT TCTGCTACCA ATTTAGAGAA ATTTTCTCTA ACCAGCAATT 12 0 

CCCCTAGTAT AACAAGTTCA AAAAATGGAG TCAATTTATC TGCTCACGGT CCAGCAGGTA 18 0 
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GCCCCGTACT TCTGAGATAA AATAGAGAGA CCCTGTAACG AACAGCAAGT CTTGAGCGTC 2 40 

TGCCCTTTCT TCAAAATCGC TGATAAATTC TCGGTAAGAA GAAACTATAT CGTAACCTGT 300 

CACATCCCTT TCGTCCAAAG CCCCCTGATA GTCAAAGCCG GTCACCTTGA GTTCCACCTG 3 60 

AGGCAATTTT TCAGTCAGAT AACCCAACAT CCCTTGATAA TCCTTACGTT TCAAGGATCC 420 

AAAGAGGATT TGAGGTCGAT AGCCTTCCTG CTCTTTTTCT TTGATAAACT CAGCCAAGCG 480 

AGTCAAGGCA GGGAGGTTAT GAGCACCATC CAAATAAATC TGTGGGCGAA TACGCTCCAA 540 

GCGAsCAGCC CAAT 554 
(2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 280: 

CCGGTTTTTC AAATGAATTT CTTGGTTGTG GCTAAAAAAT ATGCTACACT ATCAATATGA 60 

AAATTTTAAT CCCAACAGCA AAAGAAATGA ACACAGACTT CCCAAGTATC GAGGCAATTC 12 0 

CTTTAAAACC AGAAAGTCAG GCCGTGCTTG ATGCCTTGGC TCTCTATTCT GCCAGTCAAT 180 

TGGAGAGTTT CTACAAGGTA TCAGCTGAGA AAGCGGCGGA AGAATTTCAA AATATCCAAG 240 

CTTTGAAAAG GCAAACTGCT CAACACTATC CAGCCTTGAA ACTTTTTGAT GGGCTTATGT 300 

ACCGCAACAT TAAGAGAGAT AAGCTGACCG AGGCGGAACA AGATTATCTT GAAAATCATG 360 

TTTTCATTAC CTCGGCTTTG TACGGTGTTG TTCCAGTCTT GTCACCCATG GCTCCTCACC 420 

GTTTGGATTT TTTGATGAAA TTAAAAGTCG CTGGTAAGAC TTTGAAGAGC CATTGGAAGG 48 0 

CAGCCTATGA TGAAACTCTG AAGAAGGAAG AAGTGATTTT CTCTCTCTTG TCATCAGAGT 540 

TTGAGACTGT ATTTTCTAAG GAAATCAGAG CAAAGATGGT GACCTTCAAA TTCATGGAGG 600 

ATAGAGGCGG TCAGCTGAAG ATTCACTCAA CTATCTCCAA GAAAGCGCGC GGGGCCTTTC 660 

TAACAGCTTT AATAGAAAAT CAAGTACAAA CTGTGGGGGa AGCACGTCGC TTGAACTTTG 720 

CTGGATTTGT TTACCGAGAA GATTTGTCAC AACCACAGGG GGATGG 766 
(2) INFORMATION FOR SEQ ID NO: 281: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 901 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 

CCGGCCACGG TTCCATCCAA CTTCACAGGT GTGCACTTGA TTGTGTATGT AATTGTCACT 60 

AACGGTAGAA TTTCACCTAT CCCTCCTATC TGCTCGCAGT ACCCGCAGAC TTTCTGAAAG 120 

AAGAAGATAA CCTACTTATC CGTTGCTATG ATTATACTAA AGTTTCTACT TTTTTGCAAA 180 

TAGATTTTTA AATTTTTGGC TAATTGTCTG AATCAGGGTC GGAAGTTTGA CGACCTTGTC 240 

ATTGCCTAGT TTTTCGCGTG CAATTTTGAG AATGGCACCT GAGTCTTTTG AAGCAAAGAG 300 

GAATTTTCCT TTGTCTGTAA AGACTTCGAA GTGGCGGCTG ATTTTGCGTC CAGTGACATT 3 60 

GGCTCCAATC TGATTGATAT GGCTCCAAGG AATCTGGATA AATTGTTCGA CATTGACATC 420 

TGGGTAAAAT TCCAAAGCCT GATCTCCGAC AAGGAATTTC CCAACTTTCC CAGCGATAGA 480 

GAGGTAGGAA GTGCCTGTCG TACTGAGGAG TACTGTTTTG TTAAGTGATT GGGCCATGCT 540 

TAGTCTTCCT TACTTTCTCC AAAAAAGGCA TTGTAGAGGG CTTTAATTGC TGCTTTCTCT 600 

TGGTCTTTAT TGACAACAAA CATAATAGAA ACTTCACTAG AACCTTGAGA CATCATCTGG 660 

ATGTTGATTT TGTTTTCAGA TAGAGCGCGT GTCGCAGTAG CAGTCACTCC GATATGGCTC 720 

TTCATTTTTT CACCAACAAT CATAATGATA GAAAGGTCGT GTTCGATTTC TGCATGATCT 780 

ACTTTAGCCT TTTGAACCAA CTGACGCAGG ATTTCTTCTT CCTTGATGGG AGTTAGTTGG 84 0 

CGAGAACGGA GAATGATAGA AAGAwCGTCG ATACCTGTTG GCATATGTTC CCAACCGATG 900 

T 901 
(2) INFORMATION FOR SEQ ID NO: 282: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1765 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 

CCCTGTTACG TGGATAATAG GGTAAGACTG CTCAGGATTT CCTAACAAAT CCACCGCTTG 60 

CTGCATTCGA CCCAAACCTG AT CG AAAAT T CAAACCAATC CGACTATGGA GCCATTCTTC 120 

TACTTCAAAC AT AC AC AT CT CCTTGACAAA AGTCCAATCA ATTATCGCAT TAAAGTATGG 180 

TTACTAATAA AAACAAGGCC AGGATTTTCG TCCCGACCTC TTACCTGGTT AGCTAATAAC 240 

TAGCTACTAT GAATGTGAAT ATGGGCTAAA AACATCCACT GGACGTTCCA ACTCTTCCCC 300 
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ATTTCTGGGA GTTGGGGTAA AAATGTTCAC TGGACGTTCC AACTCTTCCC CATTTCTGGG 3 60 

AGTTGGGCTG ATACAGTCTC CCAGACTGTA TCACTCCTCC ATAAAGCTGT TGAAGACTTC 420 

TTCAATCATG TTCCATTCGT CTTCTGAGTC TTCTGGGATT GGTTGCAATT CGCCTTCTGT 480 

TCCATCTTCG TTTTCGATGA ATGAGTAAGC TTGGATTTCA ACTTGTCCGT CTTCGTCTTC 540 

TTCTGCGTTA ACTGGTACTA GAAGAACATA GTTTTTACCA AATTCTTCTT TTCCATCAAT 600 

TGTCAAAAGG ATTTCAAACA AGGTTTCATT TCCTTGCTCA TCTACTAGTG TGATTAGTTC 660 

ACGTTCTTCG TGGTCGTGGT TATGATCGTG TGACATAGCC TCGCCTTTAT ATTAAAATTT 720 

TCTATCTAAA TAATTTTGTA AAATCAGCTG AGCTGCTAAC TTATCAATGA CTTTCTTGCG 780 

CTTATTGCGA CTGATATCTG CTTGTTCAAT CAACATGCGC TCAGCAGCCA CTGTTGTCAA 840 

GCGTTCATCC TGATAGTCTA CTGGTAAACC AAAAAACTCT TCTAGCTTTG CTCCGTAGCT 900 

TGACTAGCTT CTACGCGCGG TCCACTTGTA TTGTTCATGT TTTTAGGCAA GCCCACTACA 960 

AATCGTTCCA CCTTGTAAGT ATCAACCAAT TCCTTAACGC GGTCAAAACC AAATTGGCCT 1020 

TGTTCTTCAT TTATCTGGAT GATTTCAAGC CCTTGAGCTG TAAAACCAAG CGGATCGCTA 1080 

ATCGCCACCC CTACCGTTTT TGAACCGACG TCCAATCCCA TAATTCTCAT AGGTTATAGA 1140 

TCGACTCCTT GTCCTTTGAG GTAGTAGCGA ACCAATTCCT CAACGATTTC ATCACGCTCA 1200 

TACTTACGGA TTTGATTTCG TGCATTATTA TAACGAGGAA CGTAGGCAGG GTCTCCACTC 1260 

AATACGTAAC CTACGATTTG GTTAATTGGG TTGTAaCCCT TATCGTTCAA CGAAGCATAA 1320 

ACATCTGTCA AAGTTTCGCT AATTTCTTTT TTATTGGAAT CGTCCAATTT AAAACGTACT 1380 

GTTTCTTCAG TAAATCCCAT TCTAACACCC TCTTTCCTTA GAATAGTACC ATTATAGCAT 1440 

AATTCCTTAC CTTCTACAAT TCAGGCAGTC TATTTATTTG GATTTTCTAT TGTTCTGTCG 1500 

CGCCATTTGC CAATCTATCT GAAATATATT TGCTTGGTTC ATTTTTCAAA AGATTTTCCA 1560 

AACCAATATT CTTCAGATGT TCCAACTGGG AAGCCTTCTT GACATCCAGA ACTTGAAAAT 1620 

CAAAACTAGT CGTTGTTTGA AGTTCCGTTG CGCTCAATAG TTTTGTTTCA AGTTTGAAAC 1680 

CTGCCAATTT ACGAGCTTCA ATGATAGACT TATCCTTCTC CTCCGCTTCA AGAAGAGCTT 1740 

TTTGAGTTTC CTCCACTCCA TGTTG 17 65 
(2] INFORMATION FOR SEQ ID NO: 283: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1346 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

CTTATCCATT CACTTTCTTG TCTGTTATTC TATAAATCTT ACTCCTAAGT ATACCACATT 60 

TGCCCCTAGA TGTGAACGAG AGAAACGCTC TAGACATTGC CAAGAAGGAA AAAAAAGGGT 12 0 

ACAATGTAAC AAAATCAAGG GAGGTCTGGA ATGAAGAAAC AAAGCAAGTA CAAAGAGGTC 180 

GTTTCCTATC TGAAAAATGG TATCGAGTCT GGACGATTTC CGACGGGTAG TCGCCTGCCT 24 0 

TCTATCCGTC AACTGAGCCT TGACTTTCAC TGCAGCAAGG ACACCATTCA ACGAGCCCTG 3 00 

CTGGAATTAC GGCACGAACA ATACCTCTAT GCCAAGCCTC AGAGTGGCTA CTATGTATTA 3 60 

GAACAAGGGC AACATCAAGA CCTAGAAATC GAGGTTACCG ACGAACATGC CAGTGCCTAT 42 0 

GACGATTTCC GACTCTGTGT CAATGAAACC TTGATTGGCC GAGAAAACTA CCTCTTCAAC 480 

TACTATGACA ATCAAGAAGG ATTAGAAGAC CTAAGACAGT CCATTCACAA ACTCCTCTTT 54 0 

GAGCAAGCTC TCTACTGCAA GGCTAACCAA CTAGTACTGA CTTCTGGAAC CCAACAAGCC 600 

TTGTTTATCC TCTCTCAAAT ATCCTTTCCT AGACAAGCCA AGGAAATCTT GGTGGAACAG 660 

CCAACCTACC ATCGGATGAA TCGCCTCTTG ATTGCACAGG GGCTGGACTA TCAAACGATT 72 0 

GAACGAGGCA TTGATGGGAT TGACTTGGAG GAGCTGGAAG GCCACTTCAA AACAGGAAAA 78 0 

ATTAAGTTTT TCTACACCAT TCCCCGATTT CACTATCCCC TGGGACATTC CTATTCTGAG 84 0 

CAAGACAAAC GATCTATTCT TAACTTAGCT GCCAAGTATG ATGTCTATAT CGTAGAGGAC 90 0 

GATTATCTGG GTGATTTGGA CTCCAAGAAG GGCCAAACCT TCCACTATCT TGATACAGAG 960 

GAGCGTGTCA TTTATATCAA GTCCTTCTCG ACCAGCCTTT TTCCTGCCCT TCGTATTACA 1020 

GCACTCATTC TTCCAAATGC TATCAAAGAA GCATTTGTGG CCTACAAAAA TATCCTAGAC 108 0 

TACGACAGCA ACCTCATTAT GCAAAAGGCC CTGTCACTCT ATATTGACAG TCAATTGTTT 1140 

GAAAAAAATC GTTTGGCTCG CTTGACCAAT CATGAATCTT ACCAAAAACA AATCGAGGAA 12 0 0 

AGGATAACTA AAACACCTTG TCCCCTTCCT CATTATTCCC TACACGATGG yTTATTGCTA 1260 

GACCTGAGAC AGTATCCTAA AATCGCCAGT CTCAAACACA GTCAACTGGG cTTGGACTTC 1320 

TTTGAAGAGG CCTATTTAAG CACCTG 1346 
(2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



WO 98/18931 



PCT/US97/19588 



1314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 

CTATATTCAG AATATGCCAA AAATTCGGAA TGGTATAAAT TTGCGGAGGG TTCATTTGAC 60 

ATATTTAGAA AACTCCCCCA AAGAATTAAT TTTAAGAAAG ATTTTTCTAG AATTTTGGCC 120 

CCCTTTATTA TTAATTTGCT TAAATTAATC AATAATTATC TAGAGAATAA AGAATACGAG 180 

TGGATTGACA AGAATGGAAA TATTTTTTCC TCTCTAGTAT TTTATTTAGA AGATTTAATC 2 40 

TATCCTTGGA TTGTTAAACC TTTGGTTTTA GAGATAAATT CATTGCGTGA AAAAGGTTTA 3 00 

CTTGAAGGGG AATCGGAGCA GCAACGGTAC AAATATTTTA TAACATTGTT TGACAAGGAA 3 60 

GAGAATATAT TAAATTTTTA TAACAAATAT CCCGTTTTAC TGAGGCAAAT ATCGGAGTCT 42 0 

TGTCTTCGGT TCTATACTTA TTTTATAGAA ATTTTATCAA ATTTAGAAAA TGATTTTAGT 480 

GTGCTAGAAG AAGAATTAGG GCTAAGGGGG AAATTAAATG ATATAAAATT TGGAAAGGGT 54 0 

GATACACACA GCCAAGGAAA AACTGTTTTG ATACTCTTCT TTGATGACGC GAAAATTGTT 600 

TACAAGCCTA AAAATTTAAT AATCAATAAC TCACTAAATA CTATTGCTGA GTATATCCGA 660 

AAGGTTGATG AAAAAATTAG GATAAGAATA CCTCGAACTA TTGCTTATTC GGATCACAGC 72 0 

TATGAAGAAT TTATTGATTA TCTACCTCTA GAGCAAAAGA AAAATTTACC TGAATATTAT 780 

TATAATTTTG GTGTGCTTTT AGCATTTATA TATTTATTTA ATGGGAGTGA TATACATTTT 840 

GAAAATTTAA TTTCCTATGG AGATATGCCT GTAATAATAG ACTTTGAAAC AATGTTACGG 900 
(2) INFORMATION FOR SEQ ID NO: 285: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 

TTATTTAGCA GAGGCAGTTT TAAATGTGAA GGATTTGGTC AGTCAAACAG TTTTTTATLA 60 

GCAGATTATT GGTTTAGAAA TCCTATCTCA AACGGATACA GAGGTCGTTC TGGGACTTGG 12 0 

AGGAAAAGCC TTGGTACACT TGATTCAAGC ACAAGAGGGT GGAGAAGTAA GGGAACATTA 180 

TGGTCTTTAC CATCTGGCTA TTCTTTTGCC GACACGAAAG GCTTTGGCGG ATGTCTTGAA 240 

GCACCTGACG GATTTACAGA TTCCTCTTGT TGGCGGTGCA GATCACGGTT ACAGTGAGGC 300 

CCTTTACTTA GAGGACTTGG AGGGAAATGG CATTGAACTC TATCGAGATA AGCCAGTTTC 3 60 

CACATGGGAT ATTCGAGAAG ATGGACGTAT TATCGGGGTG ACTGAAGTCC TTGCGGCTCA 42 0 

GGATATCTAT GAGTTGGGGG AAAGAGTAGA GCCTTTTATC CTAGCAGAGG GTACGAGAAT 480 
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GGGGCATATT CATCTTTCTG TCAAGGATAG TCGAAAGTCC AGACAGTTTT ATCAAACGGT 540 

GTTAGGGCTC GAGGATAAAT TCAGTGTGCC TAGTGCTAGT TGGATCGCAG CTGGGGACTA 600 

CCATCATCAT TTAGCAGTCA ACGAATGGGG AGGAAAAGGT CTGGATCCGC GTAAACAAGT 660 

CCTACCAGGT TTAGCCTACT ATGTCATCGA AGTCGCACAT AAAGAAGAAC TGTTAACGAT 72 0 

TGCCCAACGA GCACAAGAAG TTGACGCACC AATCAAATGG ATGACATCGA TCCAATTGGA 780 

AATCACAGAC TCAGATGGCA TCGTGACCCG TATTCGTTTA GCTAGATAGA TGGTATGTGA 84 0 

TGAAGGTAGA GCATCAATTG TA 862 
(2) INFORMATION FOR SEQ ID NO : 286: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 650 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 

TCGTTTACAA GATCGCTAAA ATGCATCTCA TGATCGCGAC CACGAATTCC AAGATAGCAC 60 

GCGCTACCTC AATCATAGAT AGTTCACTTT TTTCTTGCCC AGCAAATACT TCTAATTCCA 120 

AAGCGTTTCT CCTCATTTAT ACTACTATCG CCAGAGCGAA CAGACTCTGA CCTCATTTTA 180 

TCATTTACTC TTTATTTTAC GATAATTTTG CGGAATAGTC AAAGGTTAAG GGGGAGAAAG 240 

TGGCAGGATT AGACTAATTC CAATATAAAA CTCATTCCTT TTTCTGTTGC TCCATTTTCC 300 

ACAAATCCAA GCGACTTGAA ACACCTCCTA GAAGCATGAT TGTAGGTGTA GATTTTCTTG 360 

ACTCTCAATT CTTTCCATCC TTTTACTCGA GCCAATTCAA TCAAAGCACT TAGAATCTTT 42 0 

TTTCCAAGTC CTCGATGTTG GTAAGCGGAA TTCCCAATCA CAATGGGGAG ATTATCCTGA 4 80 

GATAGTGTAA TATCCCCAAT TGGAAACCAT TCTCCCTTCT CCTTGACTTC AATCCAAAAA 540 

AGCTCACCAT GCCGATyCAr ATAGGAATAC ATGGCTTCCA AGGTCGcTtG ACTGTAAGGA 600 

AGCTTCACCC CATCTACGAG GtAAcCAAGT TCACATCCGT GATACCAAGC 650 
(2) INFORMATION FOR SEQ ID NO : 2 87: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 287: 

GATAGCAATC CGCTTCAGAA ACTTCTCGCT TACCTCTAAC TCCGATCGCT AGTTTGGGAG 60 

AAGATACTTC CATTCTCATA CTATCTGTTG GCTTTGCAGG CTGTAAAAAC AACTTTTCTC 120 

TTGCTACTTC CTGAAAATCT GAATCTTGCA GTTCTTTGCT T T C AAAAT AG TCCTGTACTC 180 

GCTCCACATC AAAATTCCCA GCTAAAGACA GAGACATGTT TACAGGTTTG TAAAACTTTG 240 

TAAAATTTTC TTGCAAATTA GTTAGATTGA TTTGGGAAAT GGACTCCTCA CTTCCAACTA 300 

TATCAGTTGC TAAAGGTGTA CCAGGATACA AATTCGCTAA AGTTGAAAAG AATAAACACG 360 

AATCTGGATC ATCTTGGTAC ATTTCTCGTT CTTGCTGAAT AATATCCTGC TCTGTCAGAA 420 

TGGAAGCTTC AGTAAAGTGT GCTGATGTTA CCAATTCATC AAGTAAATCT AAATTTTCTA 480 

AAAAATAATC CGTTGCTGAA AAAAGATAGT TTGTTTTTGT AAAGCTTGTA AAGGCATTAC 540 

TATCTGCACC TAGACTCGTA AAAGCCGACA TCAAATCACT AGAATCTTCT CTCTCAAATA 600 

ATTTATGTTC AAGAAAATGA GCAATTCCTC CAGGATATTG TTTTACATCT CCGTCAACTT 660 

CTGTGACAAA CGTATCTACC GAACCAAACT GTACAGTGAC ACTCCCGTAA ACCTCTTTAA 720 

ATTCCTTTTT AGGCAAAAGA GCAACTGTCA ATCCGTTGGC CAAACGAGTT CGATAAACCA 780 

TTTCTTTTAC AGCTGGATAG TATTTTTCTT CAAAAACAAC CTTTGTCATT CTATTCCTTC 840 

CATAAAGTAA ATCGCTTGTA GTTTCACATT ATTAGCTACT CTACAAATAG CATCTTTGTC 900 

AATTTGTTCA AGCTTTGCAA TCCAACTTTT AAAGTCTGCT GAAGATTTTC CAAATAAGGC 960 

ATTTTGATAA GCACGTTCAA T CAATGAAG A ATGATTATCT TGAGAAAGTA ACAACGACCA 1020 

ACGAATCATT TCCTTGGTCT GATTTAACTC AAACTCTGTA AAAAAACCTT TTTTTAAATC 1080 

AAGCCGTTGA TTATTCATCA ATTTACGAGC CTGGTTACG 1119 
(2) INFORMATION FOR SEQ ID NO : 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 540 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 288: 

ACGCCCTCGC GGGGACATGA CGAATTCCCC GTTCATCACG AAGGCCGCCG AGGAGTGGGG 60 

GGTGCCGTCC AAGTCAAAAG CGGCCCCACA TCGATTCAGT TCCCCGACGA ACAGCCCTTT 120 

CCCCCAGCGT TCCTGGCTTT GCAACCGTTT CACAACAGCC TCGTAAAGTA GGCCGGACAA 180 

GGCAGACGGA CTCCAAAGGA GTTCTTCCAT CTGCAAGTGC GCCTGCGTTA TGTGATCCCG 240 
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GTCTTTTGCA TGTGTGTGGC ATGAATGCTG TTCCCAATCC CACTCCAGAA CATTCTCCTC 3 00 

AAAAGTGCGC AACGTCGCCC TGAATGAATC CTGCCTTGTA GTCGTGACCA TTCCTATGAA 3 60 

GGGTCGCAGA GGATTTTCCC CGAGTGCAAG CGCATCCTCC GGCTCAAATC GGGTGCATTT 42 0 

CACAGTCCCG CTCAACGCTA GCCCGATCCC TTTTTGGCAT GGTGACTCAA GCGTCCTTTC 4 80 

AAACAAAAGC TCCTCATCCG CTCCAACCGG CCCGACGTAG ACGCGTAGAC CGAAGTCGTC 54 0 
(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1949 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 

AAAGAATTCG ACCAATTCAA GGTTGAGGCA TCGCAAACTA TGGACTGTTC CCCCGTCAGT 6 0 

TCTGGACAGA AAACGGGATA AGGTTGGCTG TGAAGCAAGC TGCCCTCCTA CCAACAATTT 12 0 

TGGAAAGTAG GCATCAGCTG ACAATTCTTT ACAAGCATAG TCCGTTCCAT AACCTGTTAA 180 

CAGTTGAAAG AGGAACTGGA CAAGGATATC TGAATCCGAA TAACGACAGT AGCGGCGTTG 24 0 

GTCATTCGTT ACTAAATACT TAGAAATCCG CTCTTTTAGT TTCAACTGGG AAAAAAGTTC 300 

CTGAAAAAAG ATAAGACCAC CATACTGGGT TAAATGACCT CCATCGAAAG ATAGTTGGTA 360 

AAAAGACTTG TTTTGGAAGT GATGATTTGG TAAACTGTTC ATGTGAGTTT CCTTTCTTTT 420 

TGTGTTTTTT TCTACACTTA TACCATAAAG GGGAAACTCT TTTTTGTCTA GTAAAAAACA 480 

CCCATTGGGT GAAAAAAGAA ACCATCCAGG ATCTAAGCTA AGGCAAGGAT TCTGGATGGT 54 0 

TTTTAGATTT GGGGTGAATA ATTGGGGTTT TACAATATCA ACTCCCATGA TAGTCATGAG 6 00 

ATGACTCTTC ACGAATTGAC GTGATGACTG TCCTTCCTTT TGCATAATTA CCTCCGAAAC 660 

ACAAAAAAAG GGGTAGACAA TCTAGTGTCT ACCCCCGAAA GTTTATTAAA ACAAAAATCC 72 0 

TGCCAAAGAA TTTTTGGCAG GAAACCAAAT CAATTTATCA GTTTCTATCA ATCGCTTATC 780 

GCTCTCAAAG ACTGGTAAAT AGGGATTCCG CAATCAAATT GCGATACTCT ATTATTTAAG 840 

AGTAACTGAA GCTCCAGCTT CTTCCAATTT AGCTTTGATT TCTTCAGCTT CTGCAGTTGC 900 

AACGCCTTCT TTAACAAGTG CTGGTGCACC GTCAACAAGT TCTTTAGCTT CTTTAAGACC 960 

AAGACCAGTG ATTTCACGTA CAACTTTGAT AACGCCAACT TTTTTGTCGC CTGCAGATGT 102 0 

CAATTCAACG TCGAATGAAT CTTTAGCAGC ACCAGCATCA GCTGCATCAG CTGCAGCAAC 1080 



WO 98/18931 



PCT/US97/19588 



1318 

AGCTACAGGA GCAGCTGCAG TTACACCAAA TTCTTCTTCG ATAGCTTTTA CAAGGTCGTT 1140 

CAATTCAAGG ATTGAAGCTT CTTTAATTTC AGCAATAATG TTTTCAATGT TCAATGCCAT 12 00 

TGTTATTTCC TCCAAATAAG TTTTAAATTT TATAATAGTT TTTTTCGTAG CTAGksTACG 12 SO 

CTGTGTAGCT TAAGATTAAG CCGCGTCTTC TTTGCTTTCT GCAACCGCTT TGACTGCAAG 13 20 

AGCAACGTTG CGCACTGGCG CTTGAAGTAC AGAAAGGAGC ATAGAAAGAA GTCCTTCGCG 13 80 

GTTTGGAAGA GTTGCAAGTG CAAGAATCTC TTCTTTAGAT GCGACAGCGC CTTCGATTGC 1440 

ACCACCTTTA ATTTCAAGTG CTTCAGCGTT TTTAGAAAAG TCGTTCAAGA TTTTCGCTGG 1500 

TGCGATAACA TCTTCATTAG AAAATGCTAC TGCAGATGGT CCAACAAATA CAGATGCAAG 1560 

ATCTTCAAGA CCAGCTTTTT CAGCTGCACG ACGCAAGATT GAGTTTTTAA TAACTTTATA 162 0 

CTCAACTTCG CTTCCACGAA GCTCACGACG AAGAACTGTA TCTTGCTCAA CTGTCAAACC 16 80 

ACGAGCGTCT ACAACGACGA TAGATGCAGC AGCTTTCATT TTTTCAGCTA tACGTCAACT 1740 

AGTTCCGCTT TTTTAGCAAT AATTGCTTCA CTCATTAGTG TGTTCACCTC CGTAATTATT 1800 

TTGCTTGGGG AATTTTTCAA AAAGAAAAAC GCGCCCAATC CTAGACACGA AAGTACAATA 1860 

CGCTTCTTTT TACATGATAC GTTTTGTCCT CGGTAGGATA TTTATGAGTC GAGCTCCCCT 192 0 

ACTGTCTTAG GCAGTTTTTT TAGATACGG 194 9 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 90: 

GGACTGTTTG ATCTTATACA GTAGCTGCTT GATCCAAGCT TTCACCGATA GCGGCTAGGC 60 

GCTCGATAAC TTCAGCTTGT GTCAATTCAT TTTTTGAAAC ATAGCGGTTA CGTGGGTGAA 12 0 

CACGGCACTC GTGTGAGCAT CCACGAAGGT ACTTGTCTTC ATTTTCTTCT GATGTCAAGA 180 

TACGACGGTT ACAGAATGGA TTTCCACAGT TGACATAACG TTCACATGGT GTTCCATCAA 240 

ACCAGTCTTT CCCTACGATA GTTGGGTTGA CATGGTTGAC ATCAACGGCA ATACGCTCGT 300 

CAAAGACGTA CATTTTCCCA TCCCAAAGCT CACCTTGAAC TTCTGGGTCT TTACCGTAAG 3 60 

TTGCGATTCC TCCGTGCAAT TGGCCGACAT CTTTGTAGCC TTCACGGACC ATCCAGCCTG 420 

AGAATTTCTC ACAGCGAACG CCACCTGTAC AGTAAACCAC GACACGCTTG TCCATGAATT 4 80 

TTTCCTTGTT ATCACGGACC CATTGTGGTA ACTCACGGAA GTTGCGAATA TCTGGGCGAA 540 
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TAGCTCCACG GAAATGTCCT AGGTCGTACT CATAATCGTT ACGTGTGTCA AGGACAACGG 600 

TATCTTTATC AAGAAGCGCT TCTTTGAACT CTTTTGGAGA CAAGTAAGCA CCTGTTGTTT 660 

CAAGTGGGTT GATGTCATTG TCAAAGTCGT TGTCTTCCAA ACCAAGGTGG ACAATTTCTT 72 0 

TCTTGTAGCG AACAAACATC TTCTTGAAGG CTTGTTCATT TTCTTCGTCA ATCTTGAACC 7 80 

AGAGTTCTTC CATTCCTGGA AGGCTGTGAA CGTAgTCCAT GTATTTTTGA GTTGTTTCAT 840 

AGTCACCTGA AACTGTTCCG TTAATTCCCT CGTCAGCGAC TAGGATACGG CCTTTAAGGn 9 00 

CGATTGATTT ACAGAAAGCC AAGTGGTCTG CAGCAAATTG CTCTGCATTT TCAATTGGAG 960 

TATAAAGGTA GTAAAGTAAG ACACGAATAT CTTTTGkCaw AAGATTTGTA TCTCTTTATC 102 0 

TAT 1023 
(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3831 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

ACTATGAACA AGACCCAGAA AAAGTAGCCT TATTTCTTAA GAATTTTAAT AGTTTAAAGC 60 

ACCTAGCACC TGTTTAGATT GACGAAACAG GATTCGATAC TTATTTTTAT CGAGAATATG 120 

GTCGCTCATT AAAAGGTCAA TTAATAAGAG GCAAAGTATC TGGAAGAAGA TATCAGAGGA 180 

TTTCTTTGGT TGCAGGTCTA ACAAATGGTG AATTAATCGC TCCAATGACT TACGAAGAGA 240 

CGATGACGAG CGACTTTTTT GAAGCTTGGT TTCAGAATTT TCTCTTACCA ACATTAAACA 300 

CACCATCGGT TATTATTATG GATAATGTAA GATTCCATAG AATGGGGAAG CTAGAACTTT 3 60 

TATGCGAAGA GTTTGGGCAT AAACTTTTAC CTCTTCCTCC CTACTCGCCT GAGTACAATC 42 0 

CTATTGAGAA AACATGGGCT CATATCAAAA AGCACCTCAA AAAGGTATTA CCAAGTTGCA 480 

ATACCTTTTA CGAGGCTTTT TTATCCTGCT CTTGTTTCAA TTGACTATAT TAGAGGCGAG 54 0 

ACATTTTTCG GTTCTTTGTC AACTGTAGTG GGTTGAAGAA AGCGAAGATC TAGAAAGGAC 600 

AAATTTCGTC CTTTCTTTTT TGAAGTTTTC AAAGTTCCTA AAACCAAAGG CATTGTGCTT 660 

GATAAGTTTG ATGAGATTAT TGGTGGCTTC CAGTTTGGCG TTGGAATAAG GTAATTGAAG 720 

GGCGTTGACG ATTTTCTCTT TATCTTTGAG GAAGGTTTTA AACAAAGTCT GAAACAGAGG 78 0 

TGGAAAAGCA AGAGCTGATA GAGATTATAG TGGTGTTTAA AGTCTTCGGA ATAGCTCAAA 840 
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AGTTTATCTA GAATTTCTTT ATTAGTCAAG TGCATACGAA AAGTAGGGCG ATAAAATCGT 900 

TTATCACTCA GTTTCTGACT ATCTTGTTGA ATGAGCTTCC AGTAGCGCTT GATAGCCTTG 9 60 

TATTCATGGG ATTTCGGATG ATGGCTTGTG TTCTGCTCTC AAGAACAGTT ATGATATTGA 1020 

GTTTATCAAA GTCCTGAGCA ATAAAGCTCA TCTCCATCTC CCGATTGAAA CAGTCACTCC 10 80 

CCGGACTGTT TCAACsTCCT AGGACATAAT CTCAGGAAGA CGCGAAAAAT CATGCTCAAA 114 0 

GTGAAAATCA TTGTTCTTGC GAATGACAGT TGAAGTTGAA ATAGACAACT GATGATCAAT 1200 

GTCGGTCATA GAAGTCTTTT TAATTAGCTT CTGAGCAATC TTTTGGTTGA TGATACAAGG 12 6 0 

AATTTGATGA TTCTTCTTGA CGATAGAAGT CTCAGCGAGC TCCATTTTTG AGCAATGATA 132 0 

GCACTTAAAA CGGCCTTTTC TAAGAAGAAT TCTAGTTTGA ATTTTTTTAT ACTAGAAAAT 13 80 

CAGAACCATA ATACCTATAT AAAAATATTA TAGTTCTAAT AGGATTTACC CAAAAGTTTT 144 0 

AAGGCGGTCT TTTTAGAACT TTAATTGTTT GAAATTTAGG TAGCAAATTT GTTTCTATTT 15 00 

TGTCAACTTT TCCTATTTTT ATCTTGTTGA GGCTGGTATT TTAACAATTC AGGAATTGAT 15 60 

AGTGAATGTG TAAAATTTTT TGTTAGAATA AGTTTATAAA AAAGAAAAGG AGTATTTGAT 162 0 

TATGTTACAA AAAATTTATG AGCAGATGGC TAATTTCTAT GATAGTATTG AAGAAGAGTA 168 0 

TGGTCCTACA TTTGGTGATA ATTTTGACTG GGAACATGTT CATTTTAAAT TTTTAATTTA 1740 

TTATTTACTG AGATATGGCA TTGGTTGTCG TAAGGATTTT ATTGTTTACC ATTATCGTGT 1800 

TGCTTATCGT TTGTATCTTG AAAAATTGGT AATGAATCGG GGTTTTATTT CTTGTTGAGG 18 60 

TAATTTTAGT AAATTTCCGA ACTAATTTAC TCTTTTATGG AAAGATGATA GTAAATAGCT 1920 

AGTAATTTTT CTAAATCATT TTTTAATAGT TGGAAATAGC AAATCTTTCT ATTGTTTCTT 1980 

CTTGATAAAA AGGCGATTTT TTATTATAAT AAATTGTAAG ATATAATTGC AGGTGAGAGT 2040 

CCTGCCATGT ATGTGAGAAA GGAAGAGCCT GATGGCTCAG ACAAGATTAT GACTTCAGTT 2100 

GTTGTTGTAG GTACCCAATG GGGTGATGAA GGTAAAGGGA AGATTACAGA CTTCCTTTCA 2160 

GCGAATGCAG AAGTGATTGC ACGTTACCAA GGTGGTGATA ATGCTGGTCA CACGATTGTG 22 2 0 

ATTGACGGTA AGAAATTTAA GTTGCACTTG ATTCCATCTG GGATTTTCTT CCCTGAAAAA 22 80 

ATATCTGTCA TTGGGAATGG TATGGTTGTA AATCCTAAAT CTCTTGTAAA AGAGTTGAGC 2340 

TATCTTCATG AGGAAGGTGT AACAACTGAT AACTTGCGTA TTTCTGATCG TGCGCATGTT 2400 

ATTTTGCCTT ATCATATCGA GTTGGATCGC TTGCAAGAAG AAGCTAAGGG CGACAATAAG 24 60 

ATTGGTACGA CAATTAAGGG AATTGGTCCA GCTTATATGG ACAAGGCTGC TCGTGTTGGA 252 0 

ATTCGTATTG CAGATCTTTT AGATAAAGAT ATTTTCCGTG AGCGTTTAGA ACGTAACCTT 2580 

GCTGAAAAGA ATCGTCTTTT TGAAAAATTG TATGACAGTA AAGCGATTGT TTTCGATGAT 2640 
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ATTTTTGAAG AATATTACGA ATATGGTCAA CAAATCAAGA AATACGTGAT AGATACATCT 2 7 00 

GTTATCTTGA ATGATGCGCT TGATAATGGC AAACGTGTGC TTTTTGAAGG TGCACAAGGT 2760 

GTTATGCTAG ATATCGACCA AGGTACTTAT CCATTTGTTA CGTCATCAAA CCCTGTAGCT 2 82 0 

GGTGGTGTGA CAATTGGTTC TGGTGTCGGT CCAAGCAAGA TTGACAAGGT TGTAGGTGTA 2 880 

TGTAAAGCTT ATACGAGTCG TGTAGGAGAT GGTCCTTTCC CAACTGAGTT GTTTGATGAA 2 94 0 

GTGGGAGAAC GTATCCGTGA AGTGGGTCAT GAATATGGTA CAACAACTGG TCGTCCACGT 3 0 00 

CGTGTAGGTT GGTTTGACTC AGTTGTGATG CGTCATAGCC GTCGTGTTTC TGGTATTACT 3060 

AACCTTTCTT TGAACTCTAT TGATGTTTTG AGCGGTTTGG ATACTGTGAA AATCTGTGTG 312 0 

GCCTATGATC TTGACGGTCA ACGTATTGAC TACTATCCAG CTAGTCTTGA ACAATTGAAA 3180 

CGTTGCAAGC CTATCTATGA AGAGTTGCCA GGTTGGTCAG AAGATATTAC CGGAGTTCGC 32 4 0 

AATTTGGAAG ATCTTCCTGA GAATGCGCGT AACTATGTTC GTCGTGTGAG TGAATTGGTT 33 00 

GGCGTTCGTA TTTCTACTTT CTCAGTAGGT CCTGGTCGTG AACAAACAAA TATTTTAGAA 33 60 

AGTGTTTGGT CCTAAGAGAT TTTTAAGATT TGTTTAAGAT AGGTCGGGTA TACTATAGAC 342 0 

GGTTACAAGA AGACCTCCTA ACTTGTTGTA ACAAATATCC TAAACTTTTC TTTTTCATAA 3480 

TAATCTCCCT ATAGAGTCAC CGCATTCGGT GGCTTTTTTT GTGTTGGGAT TCATGATATA 354 0 

ATAATAAAAT CGATAAGTAG GAAAAGAGAA AAGAGATGTA TTATACGCTT GAAGAAAAAG 3 600 

AAGTCTTTAT GAGGGAGGCT TTGAGAGAGG CTGAGATTGC TCTTGAACAC GATGAAATTC 3 660 

CAATTGGTTG TGTGATTGTC AAAGATGGGG AAATCATTGG TCGTGGGCAT AATGCGCGTG 372 0 

AGGAATTACA GCGAGCGGTT ATGCATGCGG AAATTATGGC TATAGAGGAT GCGAACTTGA 37 8 0 

GTGAGGAGAG TGCGCTTGCT GGATTGCACA CTTTTTGTGA CCATTGAACC G 3 831 

(2) INFORMATION FOR SEQ ID NO: 292: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1441 base pairs 
(S) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 

CCGCTGTTCC AACCGCAACA TACCATAGTC CGTACGGGAT TCGAACCCGT GTTACCGCCG 60 

TGAAAAGGCG GATGACTTAA CCCCTTGACC AACGGACCTG AGTTGTTATT TTCAACTCTT 12 0 

ACTATTATAC AGTCTTTTCA AACTTTGTCA ACTACTTTTT CTAATTTTTG TTTATTTTTT 180 
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CAACTTATAG TAAAAAAAGC CAGAATTATA CTGACTCTTC TATCGCTCAT TAAACTTAGA 240 

AGCACGTTCT TTTCCCCACC AATAAGGGAT TAGTTCTGCG ACTTTAACTG TTTTTCTTAT 3 00 

ATTATAGTCC ATCATGAATT CTGCATCTTT ATTTTCAGCA TTAAGCTCTA AAAGGAATTC 3 60 

TCTACAAGCA CCGCAAGGCA TGGCTGAACT TCCACCATAA GGTGGTTTGT CTCGAAAGGC 42 0 

TAATACTTTC TTAACCTTAG TTTGTCCTGA AAATTGGTAC ATATTGAAGA GGGCCGCCCG 480 

TTCTGCGCAG AGATGGAAAA CACCACAGGT TCCCTCCATA CAGAATCCTG TAAATATTTG 540 

TCCATCTCCT GCTTCTACTG CAGCTACAAC ATGATTGGCA TAAACAAAGT CTGATACTTC 6 00 

ATGTGGATTG TATAGTTTCT GTGCTTCTTC GTACATCTTT TCCCAGATGT CCATTATTGT 660 

ATCCTCTTTA TTTAGAGATT TCTTTTAGCA TGTTTTCGAT ATGCTGAATT GATTTTTCAC 720 

GTCCAAGCAA GAAAATTGTA TCTGGTAATT CTGGCCCATG CATTTCGCCT GAAACTGCGA 780 

TACGAATAGG CATGAAAAGA TTTTTCCCTT TAATACCTGT TTCTTTTTGG ACTGCTTTAA 84 0 

TTTGTGGGAA GATATTTTCT GTCACAAATT CATCATCTGT CATCGCTTCA AGTTTTGCTT 9 00 

TGAATGCTTC AAGAACTGTT GGAACTGTTT CACCCGTCAT GACTTCGCGC TCTGCTTCTG 960 

TCAATTCTGG GAAATCTGAG AAGAAAAGAT CTGTCAATGG GATAATCTCA TCTACTGATT 102 0 

TCATTTGTGG TTTATAGAGC TCAACTAATT TTTCAGCCTT GTCAGTCAAA CGGCCTGCTT 108 0 

CCTCTAAGAA TGGTTTTGCC ATTTCAAAGA TGGTTTCAAG GTCTGCATTC TTGATATAAT 1140 

CATTGCTCAT CCAGTCTAGT TTTTTCTGAT CAAAGGCTGC TGGTGACTTG CTGAGGCGGT 1200 

TTTCATCAAA AAGTTTAATG AATTCTTCAC GAGAGAAAAT CTCATCCCCA CCACCTGGGT 1260 

TCCAACCAAG AAGAGCAATA AAGTTAAAGA CTGCTTCTGG AAGGTAACCT TTCTTTCGGT 132 0 

AATCTTCGAT AAATTGAAGT GTATTAGTAT CACGTTTAGA TAACTTCTTA CCAGTTTCAG 13 80 

AGTTGATAAT CAAGTGTCAT GTGACCGAAC TCTGGACCTT CCTCAACCTA AGAGCGGGTA 144 0 

T 1441 
(2) INFORMATION FOR SEQ ID NO: 293: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 293: 

CGGCTTATGT AGTGGCAATC TTTCTACGTA AGCGAAACGA GGGGAGATTA GAGGCGCTAG 60 

AAGAAAAAAA AGAAGAACTA TACAATCTTC CAGTAAATGA TGAAGTAGAA GCTGTAAAAA 120 
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ATATGCACTT GATTGGACAA AGTCAAGTGG CTTTCCGTGA ATGGAATCAA AAATGGGTCG 180 

ATTTATCTCT CAACTCTTTT G C CG AT AT TG AAAATAATCT CTTTGAAGCA GAAGGCTATA 240 

ACCATTCATT TCGTTTTCTC AAGGCCAGTC ATCAAATTGA CCAAATTGAG AGTCAAATTA 300 

CTTTGATTGA AGAAGATATT GCGGCAATTC GCAATGCTTT GGCAGACTTA GAGAAGCAAG 3 60 

AATCTAAAAA TAGTGGTCGT GTTCTTCATG CTTTGGATTT ATTTGAGGAA CTTCAGCATA 420 

GAGTTGCTGA AAATTCAGAA CAGTATGGTC AAGCCTTGGA TGAAATTGAA AAACAATTAG 4 80 

AAAATATCCA ATCTGAATTT TCACAATTTG TAACCTTGAA TTCATCGGGT GACCCTGTGG 540 

AAGCCGCAGT GATTTTGGAT AATACAGAAA ATCACATTTT GGCCTTAAGT CATATTGTGG 600 

ATCGTGTTCC AGCCTTGGTT ACGACGCTTT CTACAGAATT GCCAGATCAA TTACAGGATT 660 

TGGAAGCCGG TTATCGTAAA CTAATTGATG CTAATTATCA TTTTGTTGAA ACGGATATTG 72 0 

AAGCGCGTTT CCACTTGCTT TATGAAGCAT TCAAGAAAAA CCAAGAGAAT ATTCGTCAGT 7 80 

TGGAATTGGA TAATGCCGAA TATGAGAATG GACAGGCACA AGAGGAAATC AATGCCTTGT 840 

ATGATATTTT TACTCGAGAA ATTGCTGCTC AGAAAGTAGT GGAAAATCTA CTTGCAACTC 9 00 

TTCCAACTTA TCTTCAACAT ATGAAAGAGA ATAATACTTT ATTGGGAGAA GATATTGCAC 9 60 

GTTTGAACAA GACCTATTTA CTTCCTGAGA CAGCTGCAAG CCATGTTCGT CGTATTCAGA 1020 

CAGAATTAGA GAGTTTTGAG GCAGCTATTG TTGAGGTAAC TTCAAATCAA GAAGAACCAA 1080 

CCCAAGCTTA TTCAGTTCTT GAAGAAAATC TTGAGGATTT ACAAACTCAA CTAAAAGATA 1140 

TTGAAGATGA GCAAATTTCA GTTAGTGAGC GCCTGACACA AATTGAGAAA GATGATATTA 12 00 

ATGCACGTCA AAAGGCCAAT GTTTATGTCA ATCGTCTCCA TACTATCAAG CGATACATGG 12 60 

AAAAACGCAA TCTGCCAGGT ATTCCACAAA CTTTCTTGAA GTTATTCTTT ACGGCAAGCA 13 2 0 

ATAATACCGA GGATTTAATG GTTGAGTTAG AACAAAAAAT GATTAACATT GAATCTGTTA 13 80 

CCCGAGTTCT TGAAATTGCA ACGAATGATA TGGAAGCTTT AGAAACGGAA ACTTATAATA 144 0 

TTGTACAATA TGCAACTTTG ACAGAGCAAC TCTTGCAATA TTCTAACCGC TATCGCTCAT 1500 

TTGATGAACG CATTCAAGAA GCATTTAACG AaGCTTTAGA TATTTTTGAA AAAGAATTTG 15 60 

ATTATCACGC TTCATTTGAC AAGATTTCTC AAGCATTGGA AGTGGCAGAG CCTGGTGTAA 1620 

CCAATCGCTT TGTTACCTCA TATGAGAAAA CACGTGAAAC GATTCGTTTT TAATAAAAGA 1680 

AAAAGATTTT ATTGTGTGAG GAGCAGAATC AAATCTTTTT CTATAGTTGT GGGGAGATTT 1740 

ACTTCATTTT CTCCTGAGAT TGAGTTTTTG CCCAGCCGAT TTATCCACTA CCTCAAAACA 18 00 

GTGTTTTATA CTCTTCGAAA ATCTTTTCAA ATCACGTCAG CGTCGCCTTA CCGTACTCAA 1860 
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GTACAGCCTG AGGCTAGCTT CTTAGTTTGC TTTTTGATTT TCATTTAGTA TTAAAGTGAT 1920 

TTCGCCAGTC TTATCTGCAG CTTCAAATCT GTACTTTGAG TAACTTGGTA ACCGTCCAAT 1980 

AACGAAGTCT ATTGAAAAAT CTCCAGACTA GAGAACTCAC GGATAGTTCC TAATCTGGAG 2 040 

ATTTCTTATT TGCACTTTTC TTGTACAACT TTAGTCCACG GTAAATAGAC CTCTAAAACC 2100 

TCTTTGTTTA CGAGAGTTTC CTCGTTTGGA AGACATTCTA GAAGATAGGA TAGATATTTC 2160 

TCGCTATTTA TACTAGACTA AAATCAAAAA GCATTATATA ATAGTGATAT GAAATCAACT 2 2 20 

AAAGAAGAAA TCCAAACCAT CAAAACACTT TTAAAAGACT CTCGTACAGC TAAATATCAT 22 80 

AAACGCCTTC AAATCGTTCT ATAGTAAAAT GAAATAAGAA CAGTACAAAT CGATCAGGAC 2340 

AGTCAAATTG ATTTCTAACA ATGTTTTAGA AGTAGAGGTG TACTATTCTA GTTTCAATCT 2400 

ATTATATTTC GTCTGATGGG CAAATCTTAT AAAGAGATTA TAGAACTTTT ATAGTAGATT 24 50 

GAAATAAGAT GTGAACAACT CTATCAGGAA AGTCAAATTA ATTTATAGAA ATATTTTAGC 2 520 

AGCCAAGGTG TACTGTTATA GATTCAATAC ACTATAGACT GTAATCAAAC AACGATTTGG 2 580 

CGAAATGTAA AAAAATATGA GGAGTTCGGA CTCGACTCTC TCCTTCAAGA AACACGTGGT 2 64 0 

GGTCGTAACC ATGCATATAT GACAGTTGAG GAAAAGAAAG TCTTTCTTGC CCGCCATTTG 2 700 

AAGGCTGCAG AGGCAGGAGA ATTTGTTACA ATTGATGCCT TATTTCAGGC TTATAAAAAG 2 7 60 

GAGTTAGGTC GTTCCTACAC ACGTGATGCC TTCTATCAAC TGTTGAAGTG CCATGGTTGG 2 82 0 

CGAAATATTA TGCCACGTCC AGAACATCCT AAGAAAGCAG ACGCTCAAAC CATTGTCGCG 2 880 

TCTAAAAATA AAATCTCAAT TCAAGAAGAA AAGAAAGCGC TTTAAAACCA GTAGACGTTT 2940 

TCGTAAGGTT CGCTTGATGT ACCAAGATGA GGCTGGTTTC GGTAGAATCA GTAAACTGGG 3000 

ATCTTGTTGG GCTCCAATAG GAGTAGGTCC ACATATCCAT AGTCAOTATA TACGAGAATT 3 060 

TCGCTATTGT TATGGAGCTG TTGATGCCCA TACAGGCGAA TCATTTTTCT TAATAGCTGG 312 0 

TAGATGTAAT ACTGAGTGGA TGAACGCCTT TTTAGAAGAG CTTTCACAAG CTTATCCAGA 3180 

TGATTATCTT TTACTCGTTA TGGACAATGC TATATGGCAT AAATCAAGTA CCTTAAAGAT 324 0 

TCCGACTAAT ATTGGTTTTA CCTTTATTCC TCCATACACA CCAGAGATGA ACCCCATTGA 3300 

ACAAGTGTGG AAAGAGATTC GTAAACGTGG ATTTAAGAAT AAAGCCTTTC AAACTTTGGA 33 60 

AGATGTCATG AATCAACTCC AAGATGTTAT ACAAGGATTG GAGAAGGAGG TGATAAAGTC 342 0 

CATCGTTAAT CGGAGATGGA CTAGAATGCT TTTTGAAAAC AGATGAGTAT AAAAAGAAAG 3480 

TCCTCATTTC AATAGAAATC ACGACTTTCT GATGGATTTA TAGTAAAATG AAATAAGAAC 354 0 

AGGACAAATC GATCAGGACA GTCAAATCGA TTTCTAACAA TGTTTTAGAA GCAGAGGTGT 3600 

ACTATTCTAG TTTCAATCTA CTATATTTTT GGAGTGATAG AAAAGCCCTT CATAAGCTAG 36 6 0 
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TCTACTTGTT CAGGTGCGAG AGCTTTGACA TCTTTTTCTG TACTTAGCCA AGTCAGTTTT 3 720 

CCGTTCTCAA AGCGTTTATA TAGTAGCCAA AATCCTTGAC CATCCCAGTA AAGGGCTTTA 3 780 

AAGCGGTCTT TACGTCCACC ACAAAAGAGA AAGACTTGAC CGGAGAAAGA ATCCAATTCA 3 840 

AAGTGGGTTT TAACTACATA GGCTAATGAG TCTATTCCCT GCCTCATATC TGTCTTGCCA 3 900 

CAAACAAGGT GAACTTGACC TAAATCACTT AGTTGAATTA TCATAGTACA ATACCTTTCC 3 960 

TCCGATAATT ATTTTTTATC TAGTATACTG GAAGTTGGGG AATTAGGATA GATACCTTGT 4020 

TATGACGCGC TTACGTAACT TGTAACTAGC TGCCTAGTTT GATCTTTGCT TCTTCATTGA 4 080 

TTAGCAGTAG ATTTCAAAAT GATAAAAACG CATAGTATCA GGTATTGAAA TGTACTGCCC 4140 

CAAAAGTTAG ACAGAAAAAA TCTAACTTTT GGGGTGTTTT TGTTATGAAA TTAAGTTATG 4200 

ATGATAAAGT TCAGATCTAT GAACTTAGAA AACAAGGATA TAGCTTAGAG AAGCTTTCAA 4260 

ATAAATTTGG GATAAATAAT TCTAATCTTA GGTATATGAT TAAATTGATT GATCGTTACG 4 320 

GAATAGAGTT CGTCAAAAAA GGAAAAAATC GTTACTATTT TCCTGATTTA AAACAAGAAA 4 3 80 

TGATTAATAA AGTCTTAC 4 3 98 
(2) INFORMATION FOR SEQ ID NO : 2 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 

AGATTTTTAG ACTTTGTCTT TAATCGTTTC TTTTTAGGGA TGATTGCGAC ACCTTCTTTT 60 

GGCTATTAAC TTTAGCAGGA GGGATTATCC TTGGTCTAGC GCCGGCTAGT GCCACCTTGA 12 0 

TGAGCTTATA TGCAGAACAT GGTTATAGCT TTCGGGAATA CAGTTTGAAG GAGGCTTGGT 180 

CTCTTTACAA GCAAAATTTT GTCTCAAGCA ACCTGATTTT CTATAGCTTT TTAGGTGTGG 240 

GTCTAGTTTT GACCTATGGT TTGTATCTCT TGGTGCAATT GCCTCATCAG ACCATTGTTC 300 

ATTTGATTGC GACCCTTTTG AATGTCCTAG TAGTTGCCCT GATCTTTTTG GCTTATACAG 3 60 

TATCTTTAAA ATTACAAGTT TATTTTGCCT TGTCCTATCG AAATAGTCTC AAATTATCCT 42 0 

TGATTGGCAT CTTTATGAGT CTAGCAGCTG TGGCTAAGGT TCTCCTTGGG ACTGTGCTAC 480 

TTGTAGCAAT TGGTTATTAT ATGCCTGCCC TGCTATTTTT TGTAGGAATT GGGATGTGGC 540 

ATTTCTTTAT CAGTGATATG TTGGAACCTG TCTATGAAAT CATCCATGAA AAATTGGCGT 600 
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CAAAATAGAA TGAAGCAGTT TTGGCTACAT ACGCTTCTAA GAACCTATAG TTCAGTGATG 660 
ATCATTATCA TTGCGAGTTT TGCAATCTTA CTCTCTTACG CTGTCTGGGA TTCACGTG 718 
(2) INFORMATION FOR SEQ ID NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 718 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 

TCGGTACCAA AATTCTGGAT TTATACTAGC AAAGATCCAA GAGCAAATTA TTTAACAGAT 60 

TTAGGTCTAG TTTTCCCTGA ATCATTAAAA GAATTTGAGA GTGAAGATAG TTTTGCAAAG 12 0 

GAAATTTCTG CAGAAGAAGC AAATAAGATA AATGATGCTG ATGTAATCAT AACTTATGGT 180 

GATGATAAAA CTCTTGAAGC TTTACAAAAA GATCCTCTTT TAGGTAAAAT AAATGCAATT 2 40 

AAAAATGGTG CCGTTGCTGT AATTCCAGAT AATACACCGT TAGCAGCCTC ATGCACTCCA 3 00 

ACACCACTTT CAATAAACTA TACTATTGAA GAATACCTAA ATCTTTTAGG AAATGCATGC 3 60 

AAAAATGCGA AATAAAAAAC AAATAAACCT AGGCATAATT TTTATAATCT GCCTAGGTCT 42 0 

TCTTATTACA ATATTTTTGT CATTAAAGCT TGGAACAAAA GAAATTAATA TCAGAGATTT 4 80 

TTTAGCAGCT TTTGGAATGG GTAATACAAA TGATGATTTT ATTAAATCAA TTATATATAA 540 

rAGAATACCT AGAACTATTT TTGCAATTTT AGCAGGTTCT AGTCTTGCCA TAAGCGGTGT 600 

ATTGATGCAA TCAGTTACTA GAAACCCAAT AGCTGATCCA GGTATACTCG GTATAAACAC 660 

AGGAGCAAGT CTTAGTGTAG TAATTGGTCC TTCtTTTTAG GGAATTCATC AAGCATAA 718 



(2) INFORMATION FOR SEQ ID NO: 296: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 

GAACTAATCA TTTTTACAGG ATGAGATTTA CAGCAGAGAG TTTGAAGGCT TTATCAAAGG 60 

TTTTTCTTGG CATAATGACT TTTCCTCGTT TCCACTTAAT TTTGTGTCTA CTTTATTATA 120 

CCAAGTCCAC s CTTAAGTTA GATAATAAAT CTAACTTAAG GAAGCTAGAA GGATGAGAAT 180 

CCAGGTGGTC AAGAGTCCCA AACTTAAGCT GATGGGGACA CCCAGAATAA TTTGCTTTTT 240 
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GAAGGCAAGG CCACGTTCCT CTATATTGGG AAGTGAGAGT TGAATGAGAG AACCAGCTGA 3 0D 

TGAAAAGGGT GAGATATTAG TAGATAGAGC GCCAATAACG GTGGCTGTTG TGAGTAAGTG 3 60 

AATATCAATC TGAGGATTTT GAGCACTGAT GATAGCAATG ATGGGAAAGA GGGCTGGAGC 42 0 

TACAACGGAT AGGGTGGAAC TAAAGAGTGA CATCACTCCG GCTATCACAC AAAAGAACAG 480 

AGGTAACCAG AAATGAGGAA TGGTTGTTGT CATGAGGTGC CCTATCAGTG TGACTAAACC 54 0 

TGACTTGACC GCTAGAGACA TTAGTAAGCT CATGCCGCAG AGCATGATAA TTGTAGCCCA 6 00 

GGGAACCTTA GCTAAAATGG CTTCTTGCTT CCCTAATTTG AGCCTTAAGG CGAGGCAGAC 660 

CATGAGTATT G AG AC AAAG C CAATATCAAA TGTTTTTTGA TAAGTAGCTA TCCAGGCGAT 72 0 

GTTTGGGAAA ATGAGATGCA ACAAGGGAAA AAGCCAAACC AAAACCATGC TGCTGATCAT 7 80 

GAGCAAGGTG GTTTGTCTTT GAACCTTGCT GAGGAGTGGT GGTTGGTCAA TAGTCAAGGA 840 

TGAGTTTGTT CTTCCCTTAC TATAGTGACT GTAACAGGAT AATAAAAGCA AGACGATGAG 900 

TGGGTAGATA ATGCTGACGA TAAAGATATG ATTGCCAAGT GAAAAAGCTT GCTCTTCCCA 9 60 

TCCCATTTGC TTAAACAGGC CTTGAAAGAC AATGCCTGAG CTACTGGTTA TCAAATTAGC 102 0 

CCCTCCTGAA GCTCCCCAAT TGACGGCTTG AGCTCCAATC AAAGGGTGTT TGTCCGCTTT 1080 

TTGACAGAGG GTAATCGCTA GAGGACAGCA AACGGCCATA GTAGTGAAAA ATCCAGCACC 114 0 

TAAAGCAGAC AAAAGGGTTG CCATCAGGTA TAAAATCATG TAGAGGGCGT TAGGGTGGGT 1200 

GCGTGTGCGG TAGAGAATGT GTTGAGCCAA AACATCAAGA GTACCGTTAG TTGTTGCAAC 1260 

GTTATAAAAG AGAGAGACGC TAAAAATGGT AAAAAAGAGT GAGGTTGGCC AAAAATGAAG 1320 

AAGTTCTTTG GGGCTTAATC CCATGAGAGT GGTTGCGATG AGGTAAGAAA AAGCAATAGC 1380 

CAGCAGGCCA ATATTGATTT TGGTGCGGTA ACCAATTCCA ATGGCTAGAG CAATGG 143 6 
(2) INFORMATION FOR SEQ ID NO: 2 97: 

(i| SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1696 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 297: 

CCATTTGGGA AAGAACGTAA GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA 60 

GCCGTTGACA TGTCTATGAT TGACCACATT CCAGATATGA TTGAAAATGG TGTGGACAGT 12 0 

CTAAAAATCG AAGGACGTAT GAAGTCTATT CACTACGTAT CAACAGTAAC CAACTGCTAC 180 
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AAGGCGGCTG TGGATGCCTA TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC 24 0 

TTGGTGGACG AGATGTGGAA GGTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT 300 

ACACCATCTG AAAATGAGCA GTTGTTTGGT GCTCGCCGTA AAATTCCTGA GTACAAGTTT 3 60 

GTCGCTGAAG TGGTTTCTTA TGATGATGCG GCACAAACAG CAACAATTCG TCAACGAAAT 42 0 

GTCATTAACG AAGGGGACCA AGTTGAGTTT TATGGTCCAG GTTTCCGTCA TTTTGAAACC 480 

TATATTGAAG ATTTGCATGA TGCCAAAGGC AATAAAATCG ACCGCGCTCC AAATCCAATG 540 

GAACTATTGA CTATTAAGGT GCCTCAACCC GTTCAATCAG GAGATATGGT TCGTGCATTA 600 

AAAGAAGGAC TCATCAATCT TTATAAGGAA GATGGAACCA GCGTCACAGT TCGAGCTTAA 660 

GAAAGGAAAA GGAAATGATA GAGGCACAGG GTTTCTTAGT GGATAAGCAA ACAAGATGCA 720 

TTCATTACCA TAGCAAGCTG GATATTATTG CTTTACAATG CTATGATTGT AAAAAGTATT 780 

ATGCTTGTTA TCGGTGTCAT GATTCATTAG AACATCACCC TTTTGAGCCG TATCCCTTAT 840 

CTTTGATACA GGATAAGCCT ATTTTATGTG GTGTTTGTCT AAAACTACTA ACATATAAGC 900 

AATATAAAGA AAGCTTAAGT TGCCCCTTTT GTTTTTCTCG CTTTAATCCA GGTTGCCAAA 9 60 

ATCATAAGGA ACGCTATTTT AAATAGCAAA TCATCTAGTT TTGAAGTAGG AGAAAACTCA 102 0 

ATTTCAAGAG AAAATGAAGT AAATCTTCCC ACAATAAAAC GCATAATATC AAGATTGTTC 1080 

AATACCTGAT ACTATGCGTT TTTAAGATTT TAAAGACTTT TTTCCTTTAT CTGGTATTTT 1140 

GACTACTTGT TAAAACTGGG TTAATTTTCG ACTGTTTAAT AGTTATTATG CAAAGTCTAA 12 00 

AAGGTTAGAA TTGTCAAAAC AATCCGTCTA GAGTATGCGT GATGCCAACC GTGGTGGATG 12 60 

TTCTCAGTCA TGCCGTTGGA AGTACGACCT TTACGATATG CCATTTGGGA AAGAACGTAA 1320 

GAGTTTGCAG GGTGAGATTC CAGAAGAATT TTCAATGTCA GCCGTTGATA TGTCTATGAT 13 80 

TGACCATATC TCAGATATGA TTGAAAATGG TGTGGACAGT CTAAAAATCG AAGGACGTAT 144 0 

GGAGTCTATT CACTATGTAT CAACAGTAAC CAACTGCTAC AAGGCGGCTG TGGATGCCTA 1500 

TCTTGAAAGT CCTGAAAAGT TTGAAGCTAT CAAACAAGAC TTGGTGGACG AGATGTGGAA 1560 

GGTTGCCCAA CGTGAACTGG CTACAGGATT TTACTATGGT ACACCATCTG AAAATGAGCA 162 0 

GTTGTTTGGT GCTCGTCGTA AAATCCCTGA GTACAAGTTT GTCGCTGAAG TGGTTTCTTA 1680 

TGATGATGCG GCGGTA 169 6 
(2) INFORMATION FOR SEQ ID NO: 298: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEC ID NO: 298: 

CCGAGTTTAT TATGGTTTCT TCGGAATTTA TCTCAAAGAT TGAATTTGCT TGCAATAAGA 60 

AAGAAAGTCT TTATAGTCAA AGCAAATTTA AGTATGCGAT TCGTTCGATG TTCGCAGGTG 12 0 

CATTTTTAAC CTTCAGTACT GCTGCAGGTG CAGTTGGGGC TGACTTGATT AATAAAATTG 180 

CACCAGGTAG TGGACGCTTC CTCTTTCCAT TCGTTTTTGC TTGGGGCTTG GCCTACATTG 2 40 

TTTTTTTGAA TGCCGAGTTG GTCACTTCAA ACATGATGTT CTTGACTGCT GGTAGTTTCT 3 00 

TAAAAAAAAT CTCTTGGAGA AAAACAGCTG AGATTTTACT ATACTGTACC TTGTTCAACC 3 60 

TTATCGGAGC CTTGATAGCA GGGTGGGGCT TTGCTCATTC GGCAGCCTAT GCGAATCTGA 42 0 

CACACGATAG TTTCATCTCA GGTGTTGTTG AGATGAAGTT AGGCCGCTCA AATGAATTGG 4 80 

TCTTGCTTGA GGCGATTTTG GCAAATATTT TTGTAAATAT TGCGATTCTG TCATTTATTT 54 0 

TGGTCAAAGA TGGTGGTGCC AAACTTTGGC TTGTGTTGTC AGCTATTTAC ATGTTTGTAT 6 00 

TCTTAACAAA CGAGCACATT GCGGCGAACT TTGCTTCTTT CGCGATTGTG AAATTCAGTG 6 60 

TTGCTGCGGA TTCAATTGCC AACTTCGGTG TTGGAAATAT GCTTCGCCAC TGGGGTGTGA 72 0 

CTTTCATCGG AAACTTTATC GGAGGAGGCC TCTTGATGGG TCTTCCATAT GCCTTCCTCA 780 

ATAAAAACGA AGATACTTAT GTAGATTAAG AAAATGAGCA CGATTGAGTC GTGCTTTTTT 840 

CATTTTCAAA ATAAGGTAAT AGCTATTTCT TATATCAAAA TATAGAAAAC TGATATTTGT 900 

ArACTATAAC TCAAGGTGCT ACAATATCCT TAATAAAATA ATATGGAGGT CACCTTATGA 9 60 

CTTGTGATTT TAAATnTGAA ACTCTACAAC TACATGCTGG TCAAGTTGTG GCTCCAGCTA 1020 

CT 1022 
(2) INFORMATION FOR SEQ ID NO: 299: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2 99: 

CCTTAAGTAA TCTCTGATAA TATTTTCTTT ATTAGCATAG GGGAATATCG ATATAATGGC 60 

TTCATTATGA GTGGCAGGAA TATCCAATAT GGCAACTTTT CCAATAGATA ATTTAAAACT 12 0 

CATTAATAAA GTTCCTTTAG GTGAAATGTC TATTTTCTTT GATTTTAATG CTAATTTAGA 180 
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AATAGATTCT CTCGCATTAG TTACATAACC AGATATAGGC ATATCTGATA TAGATACCCA 2 40 

AGGTATTTCA GTTCCCCAAA AAGTAGCTTC ACTGCGTGGA GGAGTTTTTC CTATTCTGAA 300 

GTTAACTAGG CTAGCAAATT TAATATATCT CCATGCTTCT GGGATTTCAT ATATAGGATA 3 60 

AGAGGTTGTT TCGTCTTTGT TCCCATAATA AGAGTTATCA TCTCCTTGGG AAACAATAGA 42 0 

AATGTCCAAA TCTTTCTTTT TAATCTTGCC TTCTTCAAAG AGTTTTTGTT TTTCTGCTCG 4 80 

TATTTTTTCA AGTAAAACTT CGACTGATTC ATCATTTGGG TCTTGTTCAA CTAATTTTCC 540 

TTGCATAGCA TATTGAAGAA TAGATTTTTT TAGTTTATCT GGAAATTCTT TATCTAGCTG 600 

TTCTAGTCTA TTATAACTTT CAGCATATTC ATCTACTTTT TCTAAAGCTG ATTCGATTGC 660 

TTC 663 
(2) INFORMATION FOR SEQ ID NO: 300: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 base pairs 

(B) TYPE : nuc-eic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

CGTCGCTGAA CATGTCAACA GCAAATTAAA CTAAACAAAC TAAAATTATG TGATACTTCA 60 

CATAATTTTC TTTAGAAAAT ATTATCAGAA GAAAGTTGAG AAAAATGGCA GAAAAAACAT 120 

ATCCTATGAC CCTTGAGGAA AAGGAAAAAC TTGAAAAAGA ATTAGAAGAA TTGAAATTGG 180 

TTCGTCGACC AGAAGTGGTA GAACGCATTA AGATTGCCCG TTCATACGGT GACCTTTCAG 24 0 

AAAACAGTGA GTACGAAGCA GCTAAGGATG AACAAGCCTT TGTCGAAGGA CAAATCTCTA 300 

GCTTAGAAAC AAAAATCCGC TATGCTGAAA TCGTCAATAG CGACGCAGTT GCCCAGGACG 3 60 

AAGTAGCGAT TGGTAAAACA GTCACCATCC AAGAAATTGG TGAGGACGAA GAAGAAGTTT 42 0 

ATATTATCGT AGGTTCAGCT GGTGCAGATG CCTTTGTAGG TAAGGTTTCA AATGAAAGCC 480 

CAATTGGGCA GGCCTTGATT GGCAAGAAAA CAGGTGATAC AGCAACCATT GAAACGCCTG 54 0 

TTGGTAGCTA TGATGTAAAA ATCTTGAAGG TTGAAAAAAC AGCCTAAAAA CAGAAAAAGG 600 

AGTGGGGAGG CGATGTGCTT CACTCACTCC TTTTTCCATT TTGCTACTCT TCGAAAATCT 6 60 

CTTCAAACCA CGTCAGCGTC GCCTTGCCGT ATGTATGGTT ACTGACTTTG TCAGTTTCAT 72 0 

CTACAACCTC AAAACAGTGT TTTGAGCTAA CTTCGTCAGT TTCATCTACA ACCTCAAAAC 780 

TATGTTTTGA GCTGACTTCG TCAGTTTCAT CTACAACCTC AAAACCATGT TTTGAGCCGA 840 

CTTCGTCAGT TTCATCTACA ACCTCAAAAC TATGTTTTGA G 881 
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(2) INFORMATION FOR SEQ ID NO: 3 01: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 949 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

CCTTTTTTAA TACAAGTTAT TTTGATTTAA CCGGCTTGTC TTGAGCTGTC TGCAAAGCTG 60 

TGGCAATCGT ATCTGCATAC AATTTTGCTC CTGCTTCGAT AGTGCTACTC TCACTCCCGA 12 0 

AATGAACCTG GTCTGTTCCA GCCCAAATTT CTGGATGCTC TTTCGCAACT TGATTCCAAT 180 

CTGCTATCGT AATGTAAGGT GTCTTCTCTG CCAATTCTCT CATATAGGCA GCAGCCTTCT 2 40 

CAACGATGGC ATAGGTCTCT TTTGTCTTAT CTCCCTCATA AGGAGTCACC AAAATCATAT 3 00 

GGTGTCCCTT AGGAAGATTT TTCACGATAC TGTCCCAGTC ATCCTTGTAA TTCTCAGGAT 3 60 

TATTTACCCC AGTCGCAATG ACCACCGTCT TAGGTAAAAA TTTATTCTGG CTATTATTTA 42 0 

GCATGATTTC ATTTGCGGTC TTGGTTGTTA CGCTGACCTG CGCGTTAATC TGTGCTCCAG 4 80 

GAAGAGCTGT CTGTAGTGCT GTATTTGCCC TTAAAGCCAC TGAGTCACCA ATTAACATAG 54 0 

TGCCATCAGC AATTCCCAAA CTGTTTGCAT CTGCCCGTTC TGCCATCACC TTGGTCTGGC 600 

CAATATTTGT TGCAGCTTGC TTCAAGCCAT TGACAGTCAA GTCTGTCTCA AACGCTCCCA 6 60 

CTTGTGGTGC CAACAAGGTC ACCGTGCAGA CAATGATGGT CAAGATTCCT GTACCTGCTG 72 0 

CAAGAATTGC GTGAATATAA GGCAGGGGAC GAAsGGTTTG GACAATAGGT GTGTTCTTGC 780 

CTGCAATCCA AGGTTCCAAT ACATAAAATG ACAGACTGGC AAAGCCATAA GAACAAATCA 84 0 

GAGTCAGTAA TACAGCAAGA AGATTTGATG TCAACTGTGA GAAAATGATA TAGAAAGGCC 90 0 

AATGGAAAAG ATAAACCGCA TAGCTAGTAT CCGCTAAAAA GCTGATAAT 94 9 
(2) INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 622 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 
AAGATATATT TTTTACACAG AAGTATGCAA AAGTAAAGAG TGCAAAAAAT GGAATTAAAG 60 
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CGAAAATAAA AGCCGTGTAC AGGCGACCAA ACCAACGTAC ACGGCTAAGG AAAAATAACA 12 0 

AAACTCAAGC AAAGGCAAGG CGCGTGGTTT TGTTAGGTAT TTAGCAAGGG GACAAACCCC 180 

TTTGTAAATA ATCTCCTCTT ATTTTATCAA AATTAGAGGA AAATGACAAC TTAATTTATA 240 

AAAAGGAAAA ATGGAGGATA TAAATGGAAA TTCTGTCTAA AGAAATACAG TTACAGGGCT 3 00 

TACAACTTCT TAAACAGACT CTTGAAACTT TAGTTGAGCT AGAAAAACAA CGATCTAGTA 3 60 

AGTTAGATTT AATTTCTCGT AAAGAATTAA TGGATCTGCT AGGTATAAGT GCTACAACCC 42 0 

TTGATAACTG GGAGGATCTT GGTCTTAAAC GATATCAGAC TCCGATGGAT GGAGCTAAGA 480 

AAGTATTCTA TCGTCCGTCA GATGTGTATT TATTTTTAGC AATAAAATAG GAGTTATGAA 540 

ATGAAAATTG TTACTTTCAA ACCAACTAAA CAAATAGACG ATGGGTTTTA ACTGCCAGGT 600 

ATTGACATTC TATTTGTCTC AG 622 
(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1929 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 

CGCTAACTTG CAAACAAAAG AAGAACGCAA ACTCCACAAA TCCTTTACGC AGAAACTCAA 60 

TCTCATCTAC TTACCTTGCT GACTTGGTAG AGTATGTTGC AGACAAAGAC TTCTCAGTAA 120 

ACGTAATTTC TAAATCAGGT ACAACAACTG AACCAGCGAT TGCTTTCCGT GTCTTTAAAG 180 

AACTCTTGGT TAAGAAATAC GGTCAAGAAG AAGCTAACAA ACGTATCTAT GCAACAACTG 240 

ACCGCCAAAA GGGTGCTGTT AAGGTTGAAG CAGACGCTAA CGGTTGGGGA ACATTTGTTG 3 00 

TTCCAGATGA TATCGGTGGA CGCTTCTCAG TATTGACAGC CGTTGGTTTG CTTTCAATCG 360 

CAGCATCAGG AGCTGACATA AAAGCTCTTA TGGAAGGTGC GAATGCAGCT CGCAAAGACT 42 0 

ACACTTCAGA CAAAATCTCT GAAAACGAAG CTTACCAATA CGCAGCTGTT CGTAACATCC 480 

TTTATCGTAA AGGCTATGCA ACTGAGATCT TGGTAAACTA TGAGCCATCA CTTCAATACT 540 

TCTCAGAATG GTGGAAACAA TTGGCTGGTG AATCAGAAGG AAAAGACCAA AAAGGTATCT 600 

ACCCAACTTC AGCCAACTTC TCAACTGACT TGCACTCACT TGGTCAATTT ATCCAAGAAG 660 

GAACTCGTAT CATGTTTGAA ACAGTTGTCC GTGTTGACAA ACCTCGTAAA AACGTGCTTA 720 

TTCCTACTTT GGAAGAAGAC CTTGACGGAC TTGGTTACCT TCAAGGAAAA GACGTTGACT 780 

TTGTAAACAA AAAAGCAACT GACGGTGTTC TTCTTGCCCA CACAGATGGT GATGTACCAA 840 
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ACATGTATGT GACTCTTCCA GAGCAAGACG CTTTCACTCT TGGTTACACT P. 
TCGAATTGGC AATTGCCCTT TCAGGTTACT TGAATGCTAT CAACCCATTT GACCAACCAG 
GTGTTGAAGC TTATAAACGT AACATGTTTG CCCTTCTTGG AAAACCAGGA TTTGAAGAAT 
TGAGCAAAGA ACTTAACGCA CGTCTATAAT AGAAGAAAAG AGTGGTTTGC CCACTCTTTT 
TACTCTCTTT ATCCATAGAA ATTGGACTCA GCCAAGACTT GTGATATAAT ATAGAAAGCA 
AAAAGGCAGA CGCCTAGATA ATAGGAGAAA CTATGTCAAA AGATATCCGC GTACGTTACG 
CACCAAGTCC AACAGGACTA CTACACATCG GAAATGCTCG TACAGCATTG TTTAATTACT 
TGTATGCGCG CCATCATGGT GGAACATTTC TCATCCGTAT CGAAGATACT GACCGTAAAC 
GCCATGTCGA GGATGGTGAA CGTTCACAAC TTGAAAACCT TCGCTGGTTA GGCATGGATT 
GGGATGAAAG TCCAGAATCA CATGAGAATT ATCGCCAGTC TGAGCGTTTG GACTTGTATC 
AAAAATATAT TGACCAACTA TTAGCTGAAG GAAAAGCCTA TAAATCTTAC GTTACAGAAG 
AAGAGTTGGC AGCTGAACGC GAACGCCAAG AAGTAGCTCC CGAAACACCA CGCTACATCA 
ATGAATACCT TGGTATGAGT GAAGAAGAAA AAGCAGCTTA CATCGCAGAA CGTGAAGCAG 
CAGGGATCAT CCCAACTGTT CGTTTGGCTG TCAATGAGTC AGGTATCTAC AAGTGGCATG 
ATATGGTCAA AGGCGATATC GAATTTGAAG GTGGCAATAT CGGTGGTGAC TGGGTTATCC 
AAAAGAAAGA CGGTTACCCA ACTTACAACT TTGCCGTTGT TATCGATGAC CACGATATGC 
AAATCTCTCA TGTTATCCGT GGAGATGACC ATATTGCTAA TACACCAAAA CAGCTTATGG 
TCTATGAAGC TCTTGGTTGG GAAGCTCCAG AGTTCGGTCA CATGACCTTG A 
CTGAAACTG 

(2) INFORMATION FOR SEQ ID NO: 3 04: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 708 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

AAATTTAAGA AAAAGGAGAC ACATCATGTC TAAAAAAGTA TTATTTATCG TCGGATCACT 60 

ACGTCAAGGT TCTTTCAACC ACCAAATGGC GCTCGAAGCT GAGAAAGCAC TTGCTGGTAA 120 

AGCGGAAGTT AGCTACCTTG ATTATTCAGC CCTTCCTCTC TTCAGCCAAG ATTTGGAAGT 18 0 

TCCAACACAT CCAGCTGTAG CTGCTGCTCG TGAAGCAGTT CTCGTTGCGG ATGCTATCTG 240 
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GATTTTCTCT CCAGTCTACA ACTTCTCTAT CCCTGGTACA GTGAAAAACT TGCTTGACTG 300 

GCTATCTCGT GCCCTTGACT TGTCTGATAC ACGTGGCGTT TCTGCCCTTC AAGACAAGTT 360 

TGTCACAGTA TCATCTGTAG CCAATGCAGG GCACGATCAA CTTTTCGCTA TCTACAAAGA 42 0 

CCTCTTGCCA TTTATCCGTA CACAAGGCGT TGGTGATTTC ACTGCTGCAC GTGTTAATGA 4 80 

CTCTGCCTGG GCAsACGGAA AATTGGTTCT TGAAGAAACA GTCCTAAACT CACTTGAAAA 540 

ACAAGCTCAA GACTTGGTCG AAGCTATCAA GTAACTAACA CTCAATAAAA ATCAAAAAGC 600 

AAACTAkGAA GCTArCCGCA AGCTACTCaA gCACTGCTTT GAGGTTGTAG ATAGAACTGA 660 

CGAGTGTnnA ACATATATAC GGTAAGGCGA CACTGACGTG GCTTGAAn 708 
(2) INFORMATION FOR SEQ ID NO: 305: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 781 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) ; 


SEQUENCE DESCRIPTION: ! 


3EQ ID NO: : 








CTTCTTTTCT 


TGGAAATAGG 


TGTATAATAC 


GTTTATTAAA 


TTTTTGAGGA 


GTTGTCTATG 


60 


AAGAAAAGTT 


TTATCCATCA 


acaagaagaA 


ATTTCCTTTG 


TCAAAAACAC 


TTTTACCCAG 




TATTTGAAAG 


ATAAGCTAGA 


AGTTGTCGAA 


GTTCAAGGTC 


CTATCTTGAG 


TAAGGTCGGT 


180 


GACGGAATGC 


AGGACAACCT 


GTCTGGTGTG 


GAAAATCCAG 


TATCGGTCAA 




240 


ATCCCTGATG 


CTACTTATGA 


AGTGGTGCAC 


TCACTTGCTA 


AATGGAAACG 


CCACACCTTG 


300 


GCTCGTTTTG 


GCTTTGGTGA 


AGGAGAGGGT 


CTCTTTGTCC 


ACATGAAAGC 


CCTTCGTCCA 


360 


GATGAGGATT 


CCTTGGATGC 


AACCCACTCT 


GTTTATGTTG 


ACCAGTGGGA CTGGGAGAAG 


420 


GTTATCCCAA 


ATGGTAAGCG 


TAACATCGTT 


TATCTAAAAG 


AAACAGTTGA 


GAAGATTTAT 


480 


AAGGCTATTC 


GCCTGACTGA 


GCTAGCTGTT 


GAAGCCCGCT 


ATGACATCGA 


GTCTATCTTG 


540 


CCAAAACAAA 


TTACCTTTAT 


CCATACAGAA 


GAATTGGTAG 


AACGCTACCC 


AGACTTGACA 


600 


CCGAAAGAAC 


GTGAAAATGC 


GATTTGTAAA 


GAATTTGGAG 


CCGTCTTTTT 


GATTGGTATC 


660 


GGTGGCGAGT 


TGCCAGATGG 


TAAACCGCAC 


GATGGACGTG 


CACCAGACTA 


TGATGACTGG 


720 


ACAAGCGAGT 


CTGAGAATGG 


CTACAAGGGT 




ATATTCTTGT 


CTGGAATGAG 


780 


T 












781 



(2) INFORMATION FOR SEQ ID NO : 306: 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

CCCGCATCTT GTAGGGTTTT AACGGGCACG ATTTTCATAT CCGTCTTGAT TGTTTTAGCC 60 

GCTTCTAGGG CTGTTTGGTA GTTGTTTTTC GCGTCCGGAT GCGCCTTTTG TTCTTCTTCG 12 0 

CTAACAGGGT TATCAGGAGC AAAGAAAATA GCAGCACCTG CCCTAGCCGA AGCTACAACC 180 

TTCTTATCAA TACCTCCAAT GTCTCCCACA TTACCATCGC GGTCAATGGT ACCTGTACCG 24 0 

GCAACAATAC GACCATTACG AAGATCTGGG TGAGCTATTT GAGTATAGAT AGCTAGACTA 300 

AACATGAGAC CAGCACTTGG ACCGCCAATA CCAGCTGTTG AAAAGCTAAT TGGGACATTG 360 

CTGATTACCT CTGTACGGTC AATCAAGCCG ATTCCAATTC CATTTTTGCC ATTTTCCAAG 42 0 

GTGATGATTT TTCCTTCTGC AGACTTGGTT TGCCCATCCT CTTCATAGGT GACCTTGACG 480 

GAATCCCCTA ATTTTTGAGA ACTGACGTAA TCAATCAAGT CTTTGGAACT ATCAAAGGTC 54 0 

TGATCATTGA CTGCTGTGAC TGTATCAGAG ATATTGAGAA TCCCTTTAAA GGTTGAATTA 600 

TCCGTCACAT TCAAAACATA AACTCCAAAG TACTTGAGTT CGATATCCTT ACCAGCTGTT 660 

TTTAGTCCTT GATACTTGGC CATATTTTGC GATGTTTGCA TGTAGAATTG ATTGATTCGC 720 

ATAAATTCAA CATCGGAAGA ACCACCTGTA GTCTCCTGAG CACTACGAAT ATCTGTAAAA 780 

GGTGTCAACC AAGCATAAAT CATATGAGCT AAAGTGGCAT GTTGAACACC AACCGTAACG 84 0 

AATTGT 84 6 
(2) INFORMATION FOR SEQ ID NO : 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 829 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D| TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 307: 

GCGATCTGCT TGGGCTTTTC CTATTACCTT ATCTAATAAA TAGGTACGCA GACTCATAAC 60 

CATATAAAGT CCACCCCCCA TGGCACCGAC AAGAGCTACA TAAAAGAAGC TCCACAAACG 120 

TCCACTTGGT TGGAAGAAAA ATCCTAACAG CCACTGGATG GTTCCTATTA ACAGAAACAT 180 

GACTAGGGTC AGCAAACTGA TTAAAATGGT TCGCTTCAAA ATCACCTTGC GCTTGACACC 240 
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TGAAATCAAA GGACCATAAC TGTGGAAGAG GGCGATGGTA GGTAGTTGCA AGACTAGCTT 3 60 

GGCAATAGAA CCATAGATAA AATAGAGAAC GGCCTTGCGG TTGCGGAACA TGGCCTGAAG 42 0 

CATTGGAGAC AAGACCATGT ACAAGCCTAA AATAATAGAC TGCAAAACTG CAAAGACAAA 480 

TAAGCCCAGA GCCAAACTAT CTGGCTTACC ATAGAAGACC GTATAAAGAG GTTCTCCTAC 540 

CATAACCACT CCAACCGTTG CTGGTAGCAA GAACATAAAG AGTAGGGTGA GACTGTCCTG 600 

AACGAGACGA GAAGCTGCTT TCAAGTCCCC CTTGACATAG TTTTCCGTCA AAAGTGGCAA 660 

ACCAACACTC CCAATCGAAA CCCCTACAGA AATCAAAATC ATCGTGATTT TATTAGGATT 72 0 

GGCTGAGAAA TAAGAAAACA TGACAACCAA GTCCTCATTG CTGTAGTTGG TAAACCAGCT 780 

CATACTATTG ATAAAGGTCA GCTGAGTCCA AATCTGGAAG AGCTGGATG 82 9 
(2) INFORMATION FOR SEQ ID NO : 3 08: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 308: 

CGAACATCTT GCTGGCTGAT TCGTCTGCCG CCATCGCAGC CCCGAACACA TTGCGACCCA 60 

TGGCAAGCGG GCTCAATCCG CACATGGGAT CCGTGCCAAA GCCCCGCGTG TGCATCATTT 12 0 

GCTCATCTAG TAACGTATGA GGTTTGCCTT CGCTGTCGAT AAACCGATAT TCAATCGCAC 180 

CACTGCTCGT TCTCCGCGGA GGGGAAACCG ACTGCGGTAG GATGAACTCC AGAGAAGAGA 240 

GATCACGACC TACCAGGTGC GGCTCGTTGA AGCTGTTGCC GCTTAGCAGC AGGCTCGCCA 300 

CCACGCATTC CCAGAACTCA ACGGGGGTTT GATCGGCGTT CGGTTGCTGA CTAATAACTC 3 60 

GGTGCACGGG ATGCGAAGTG GCCACTTCTG GCACACCGTT CTTGTCTTCG TAGAGAGCAA 420 

TTGGGAGGGT GGCCAGCGTT TCGGCGATGA GGCGCACGCA GGCC 464 
(2) INFORMATION FOR SEQ ID NO: 309: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 982 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 
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CCGTCTATAA TGGTAATAGA TTTTATTTGG AGGTTTTTAT GTCATTTCTA TCAAAAAATG 60 

GAGCAGGTAT CTTGGCCTGC CTTCTCATTT CCATCCTATC TTGGTACTTA GGAGGATTCT 12 0 

TCCCTGTGGT TGGCGCGCCC GTTTTTGCCA TTTTCATAGG CATGCTCCTA CATCCCTTTC 180 

TCTCGTCCTA TAAACAACTG GATGCTGGTT TGACCTTTAG TTCCAAGAAG TTGCTCCAAT 240 

ATGCCGTTGT CTTGCTTGGT TTTGGTCTCA ATATCTCGCA GGTCTTCGCA GTTGGCCAAT 300 

CTTCACTCCC TGTCATCCTG TCCACTATCT CAATAGCTCT GATTATTGCC TACCTCTTCC 3 60 

AGCGTTTCTT TGCCCTGGAT ACAAAACTGG CTACCTTGGT TGGAGTAGGT TCTTCTATCT 420 

GTGGGGGTTC TGCCATTGCA GCGACAGgCC CGTTATTGAT GCTAAGGAAA AGGAAGTAGC 480 

CCAAGCCATT TCCGTTATCT TTTTCTTCAA TGTCTTGGCT GCGCTCATCT TTCCAACCCT 540 

CGGCACCTGG CTTCATCTAT CCAATGAAGG CTTCGCCCTC TTTGCAGGGA CTGCGGTCAA 6 00 

CGACACTTCC TCTGTAACGG CTGCCGCCAG CGCTTGGGAC AGTCTTTACC AAAGCAATAC 6 60 

CCTCGAGTCT GCAACCATTG TTAAACTCAC ACGTACTTTC GCCATTATCC CTATCACGCT 72 0 

CTTTCTATCC TACTGGCAAA GTCGCCAACA AGAAAACAAG CAAAGCCTGC AACTGAAAAA 780 

AGTCTTCCCA CTTTTTATCC TTTACTTTAT CCTTGCCTCT CTCCTCACTA CACTACTCAC 84 0 

CTCTCTAGGT GTGTCCAGTA GTTTCTTTAC TCCTCTCAAA GAACTCTCTA AATTCCTTAT 900 

TGTCATGGAC ATGAGTGCTA TCGGTCTCAA AACCAATCTG GTCGCTATGG TCAAATCCAG 960 

TGGAAAATCC ATTCATCATG GA 982 
(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1939 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 

CTAGCTGCCA ATATGATTGG GGTGCAGAAG CGCGTGATTA TCTTTAATCT TGGCTTGGTT 60 

CCTGTGGTCA TGTTTAACCC AGTGCTTCTG TCCTTTGAAG GATCCTATGA GGCAGAAGAA 120 

GGCTGTTTGT CCTTGGTAGG TGTGAGATCA ACTAAGCGTT ATGAAACCAT AAGGCTTGCC 180 

TATCGTGACA GCAAGTGGCA GGAACAGACC ATTACCTTGA CAGGCTTCCC AGCTCAGATT 240 

TGCCAGCATG AGCTGGATCA CTTGGAAGGA CGAATCATTT AGGAGGAAAG CAAATGAAAC 300 

GAATAGTCTT TGAACTTATT TTTATCGCAA CGACCTGGTA TATCTTTTTA CCGCCCCTTA 360 
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ACCTGACCAG CTGGGAATTT CTCTTCTTCC TCTGTGGGCA TTTGTTAGTT GTGGCAATAT 420 
TATTTGGCTT TGGCAAGGGG ATAAACCTTG TCAAAACGGT TCATGTGCGC CACGGTAAGG 480 
CGGAAGCTGC CTTAAATCTT GAGGGTTTCA AAATCAATCG GTTAGGGAAA ATTCTGTTAG 540 
CTTCGATTGG AGGAATTCTT CTCTTGGCAG CTTTGGTTTc CTTGGTAACT TCCAGCATGT 600 
TTCAGGCTAA AAATTATGCC AATGTAGTCA CGGTTACGGA AAAAGACTTT ACTGAATTTC 660 
CTAAGAGTGA CACCAGTAAG GTTCCTATCC TAGATAGAAG TACTGCTGAA AAAATTGGAG 72 0 

ACCGCTACTT GGGTTCCCTA ACCGATAAGG TGTCGCAATA CGTAGCGGCA GATACCTATA 780 
CCCAATTGAC AATTGATGGG AAACCTTATC GGGTCACACC ACTAGAATAT GCAGACCCTA 840 
TCAAATGGTT TAACAATCAA GCCAAGGGAA TCGGTGAGTA TATTAAGGTG GACATGGTAA 900 
CTGGAAATGC GGATTTGGTG GACTTGAAGA CACCAATCAA GTATTCAGAC TCGGAGTATT 960 

TTAACCGTGA TGTCAAACGT CACCTGCGCT TGAAGTACCC GACCAAAATC TTTAAAACTC 1020 

CATCTTTTGA GGTGGACGAT GAGGGCAATC CTTTCTATGT AGCAACGGTT TACCAAAAGC 1080 

AATTTGGACT TGCTGTTCCT CGTCCTGCTT CAGTCATTAT CTTGGATGCT ACAAATGGAG 1140 

AAACCAAGGA ATACAGCTTA TCAGATGTTC CAGAATGGGT GGACAGGATC TATCCAGCAG 1200 

AGGAAACCAT TGAGCAAATC AACTACAACG GCAAGTACAA GGACGGTTTC TTGAATGCCA 12 60 

TGATTTCCAA GAAAAACGTG ACCCAGACTA CCAATGGCTA TAATTACTTG TCTATCGGTA 1320 

ATGACATCTA TCTCTACACA GGTGTGACGT CGGCTAATGC GGATGAGAGT AATCTTGGTT 1380 

TCATCCTTGA AAATATGCGA ACAGGAGAAA TCACTAAGTA TAGCTTGGCT TCTGCGACAG 1440 

AAGAATCAGC CCGTGAATCA GCAGAAGGTG CTGTTCAGGA GAAATCCTAC AAAGCAACCT 1500 

TCCCAATCCT CATCAACCTC AATGACAAGC CTCTCTACAT CATGGGCTTG AAGGACAATG 15 60 

CTGGCTTGGT CAAAGAGTAC CCCCTGGTAG ACGCAGTCGA GTACCAAAAT GTTATCGTTG 162 0 

CTACTACAGT GGAAGAGATG CTCAGCAAGT ATGCCAATAA AAACGACCTT GAAATTGACA 16 BO 

ATGCAACGAC AGAAAGCATC AATGGAGTAG TAGCAGACCT CAAATCAGCT GTTATCAAGG 1740 

GAGACACTGT CTACTTCTTT AAAGTTGATG GCAACATCTA CAAGGTCAAG GCTTCAGTAT 1800 

CCGATGACCT TCCTTACCTT GAAAATGGTA AAACCTTCGA AGGTCAAGTA GGAAAAGACA 18 50 

ATTATCTCAA GACCTTTAAG CTACGGTAAA AATAGGTTTT TTTCAGAAAG TATATGTTAT 1920 

AATAAGGTAA ATTAAGCCG 193 9 
(2) INFORMATION FOR SEQ ID NO: 311: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 907 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 311: 

CCTGCTAATA GAGAGAAAGA CTAGGAGTAG AAGTAAGCCA ATTAAATAAT GAGAAAGTTT 60 

CATACCCCGT CCTTTCATGT AGATTTGGTA TCGAAAGATA TCTGCGGATA TAAATGTAAC 12 0 

ATTATTTTTC TAATCTGTCA ATAAAATTTC TGACAATTTA ATAAATACAA CAAGGAGAGA 180 

GCAACAAGAC TTTCTCCTTT GTTATCCTAT TCTAAAATGT TTTTACCTTA AT C T GAT AAA 24 0 

ATAATATCTT CGAGGGAGTA GCTAGCCGTC CAATCAAGAT ATTGTTTAGC TTTTGAAGCA 3 00 

TCTGCTAGGA CACTGGCTGG GTCACTAGCA CGTCGAGCAA CAATCTCGTG TGGGATTTTT 3 60 

TAATTTAGTA ATTCTTCAGC AGTTTTAAAG ATTTCTTTGA TAGTATAGCC TTTTTTAGTT 42 0 

CCTAAGTTAA AGATTTGAGA AGAACTGTCT TCTTGAAATA GGTAGTTCAT TCCTTTAACA 4 80 

TGAGCCTATG CAAGGTCCAA GACATAAATG TAATCTCGAA TACATGAACC GTCACGTGTA 54 0 

TCGTAGTCAT CTCCAAATAT TTTTAAGCTA TCATTTTGTC CCAATGCGGT CTTGTTGATA 60 0 

TTTGGAATGA TGTGAGTTGG ATTTTTCACA CGCAGACCGT TTGAAGCATC CATTTCAGCC 6 60 

CCAGCAACAT TAAAGTAACG GAAAATAACA TATTTCCAGT CGTAGCGATT GGCCATCCAG 720 

TAAATCATTC GTTCGCCCAT CAGTTTTGTC TCTGCATAAG GGTTGACAGG GTCGAGCAGG 780 

GTATCTTCAG TCACCGGCTT GTCAATACAG TTATTTCCAT AGAGAGAAGC AGTCGAAGAG 84 0 

AACATGATTT TTTGAATGCC AACTTCAGAT AAGACTTTGA GAACTTGGTT CATACCAGCA 900 

ACGTTGG 907 

(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2170 base pairs 
(K) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 312: 

C C AC AT AAAG GTAAATATCT TTTGTACTAT CTTGGGCATC CAAGAAAAGC AATTGGGCAA 60 

TAACAGAGTT AGCCATATTG TCTTCAACCG GACCTGTCAG CATAATGATG CGGTCTTTGA 12 0 

GAAGACGTGA GTAAATATCG TAAGAACGTT CTCCACGGCT TGTTTGTTCA ATAACTACAG 180 

GAATCATTCA TTTCTCCTTT TGAGTTTTAA TTTTGTTGGT CAAATGACTG AAGATAAGAC 240 
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TATTATAATA TCTTGGTCAA AAAAGGTCAA ATTTTTGCTC TGCTTTCATT AGACAGAAAC 300 

AAAAACCCAA CCTCCTTTCG TGACTGGAAA TACTTTTCCA AGTCATTCTT CTTTTCGATC 360 

TTATTTTGTA CCGAACAAGC GGTCTCCAGC ATCTCCAAGA CCTGGAACGA TATAACCGTG 420 

TTCGTTCAAA CGTTCATCCA AGGCTGCTGT AAAGATTTCT ACATCTGGAT GAGCTTCTTG 480 

AAGGGCTTTT ACACCCTCTG GAGCAGATAC AAGGCAGACA AATTTGATAT TTGATGCGCC 540 

ACGTTTTTTA AGAGAATCAA CAGCCAAGAT TGCTGAGCCA CCTGTTGCCA ACATTGGGTC 600 

TACTACAAAA ATTTGACGTT GGTCAATGTC CTCAGGCAAT TTCACCAAGT ATTCAACTGG 660 

TTGAAGTGTT TCTTCATCAC GGTACATACC GATGTGGCCA ACTTTAGCAG CTGGAACCAA 720 

GTTCAAGAGA CCATCAACCA TCCCGATACC TGCACGCAAG ATTGGGACGA TGGCCAATTT 7 80 

CTTACCTGCC AATTGTTTTT GAACTGTTTT TGTAATTGGT GTTTCGATTT CCACATCTTC 840 

TAGTGGAAGA TCACGAAGTA CTTCATACCC CATCAACATT GCAATCTCAT CTACTAGCTC 900 

ACGAAAAGCT TTTGTAGAAG TATCTGTACG ACGCAAGATT GACAATTTGT GTTGAATCAG 960 

TGGGTGATTA ATAACTTCAA TTTTTCCCAT TTTTGGAATT CCTTCTTTCA ATTTATTCTT 1020 

CTTATTATAC CAAAAAACGG TTTAAAAATC TTTCTAAACC ATTTATTTTT GATAATTTTT 1080 

ACATTAGATC AGCCTCTTTA AGAGCTGTCT GTACTGTCTC AAGTGGTAAA TGGGTCAATT 1140 

CTGTCCCTTT TTCTTGATAA AGGTATTGGG CGTAGTCGTC CATTCGGTAC TGGTTGATAT 12 00 

AAACCACGCG CTTGCAGCCG ACCTGAAGCA ATTGTTTTGT ACAGTTGAGA CAAGGAAAAT 12 60 

GGGTTACATA GGCTGTAAAG CCTTTGGGAA CACCACGCTC AGCACCTTGA AGGATAGCAT 1320 

TGACCTCAGC GTGAAGGGTG CGAACGCAGT GGCCTTCAAT GACCAAACAT TCGTGATCAA 13 80 

TACAATGCTC AGTCCCTGAC ACCGAACCAT TGTAACCAGT GGAAATAACC TTATTATCTT 1440 

TTACCAGAAT CGCGCCCACT TTAGCACGTT TACAAGTGGA ACGATTCGCA ATTAGTAGAG 1500 

CTTGGGCTGC AAAATACTCA TCCCAGGCCA GTCTTTTTTC AGTCATCTCT TTTCTCCTTT 15 60 

TTCTCTATTT TTTAAAAAAT GGTAAACCTA AATCTGCAAT CTTTTCAGCT GGTACCTTCA 1620 

TGCCATCCTT GATCCATTTT AGAAGGACAG AGACGATGGC TGAGCTCCAG AAGGAATGAA 1680 

GATAAGAGCT GACACCTTTT GATTTCCCAT GGTATTTTTC TAGAAATTCC TGCATGGCTT 1740 

GGACAAAGAT TTTTTCCAGA TGGTAATCCA AGGCCAATTG AATTACTCTA GCTTCCTTTC 1800 

TGGCCTCCCG GAAAAGGTGA ACCCAAACCA AATAAAGGTC TGTCTTTAAA TCGTAATGAT 1860 

GCAGCTGTTC CATAATATTG TGGACAGTTC GTTTAAAGAC GCTCTCTAAA ATTTCCTCTT 1920 

TGGAGTCATA ATTGCGATAA AAGGCCGCAC GCGAAACACC TGCACGTTTG ACCAATTCAG 19 80 

AAATACTAAT CTTGGTCAGT TCCTTTTTTT CCAAGAGTTG CAAGAGGGCT GTTTCAATGG 2040 
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CTTCTCTGGT TAATAAATTG GATTCTTGGT TTGATTTTCT GAGATTTTCA AGAGACTTTT 2100 

CAGAGATTCT ACGTTCAGAC ATAACATTTT CTTTCTACTT GTCACAACAG ACGGATGATG 2160 

CTTTTGTTTC 217 0 
(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 53 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 

ATCTGCACGA ATCAGGGCTT TCTAAGTGAC TATTTCCACC GAAATATTAT TTATATCAGG 60 

AGGACATTCA TATGTCACGT TATACAGGAC CATCTTGGAA ACAAGCTCGT CGTCTTGGCC 12 0 

TTTCACTTAC AGGTACAGGT AAAGAATTGG CACGTCGTAA CTACGTACCA GGACAACACG 180 

GACCAAACAA CCGTTCTAAA TTGTCAGAAT ACGGTTTGCA ATTGGCTGAA AAACAAAAAC 24 0 

TTCGTTTCAC TTACGGTGTA GGTGAAAAAC AATTCCGTAA CTTGTTCGTA CAAGCTACAA 3 00 

AAATCAAAGG CGGAATCCTA GGTTTCAACT TTATGCTTCT TTTGGAACGT CGTTTGGATA 3 60 

ACGTTGTTTA CCGTCTTGGT CTCGCGACTA CTCGTCGTCA AGCTCGTCAA TTCGTAAACC 42 0 

ACGGTCACAT CCTTGTTGAC GGGAAACGCG TTGATATCCC ATCATrCCGC GTAACTCCAG 4 80 

GTCAAGTGAT CTCAGTTCGT GAAArATCAT TGAAAGTTCC AGCAATCCTT GAAGCAGTA 53 9 
(2) INFORMATION FOR SEQ ID NO: 314: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 667 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 314: 

CCGGTTTTGC TCCTTCTCTA CGGCTACGAC GTGATGTATC TCTGATGATA TCCACTGTTT 60 

CTGTAGCAGG CGTAGGTGTT TCTGGACCTG CTTGTTCTGC TTTTTTCTCT GCCGTCGTAT 120 

AGGAAACAGC TACCCTTGTT GGGGTTTCAT TGTATTCTCT TTCAAGTTTC TTAGGTCTAA 180 

CAGGACCTGG ACCTGGTCTT GATCCACTTT CTTCCGCTGG AGAAGAAGGT ACATCTTGAC 240 

TTGGATGACT TGGAACACCA GGAGTTTCTC TTTGAATCTC ATCTGCTGGA GAAGCTGGTA 300 
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CCTCATCTTC TTTCAGAACT TCATCATAGC CTTTTACTTT TTCTAAATCT CTCAGAATCT 
GCTCTTTAAA GCGTAATTTC TCTTCTGCTC TTGACTTTTC ACTCAAAAGT TTTTCCTCCT 
TGTTGAGAAT CCATAATATT AGAGCTGAGA AGTCCAAAAA AAGCAATCTA TGATACTTTT 
CCTAACGGAT TTTGTCATTT CCCAGACCAT ATCATACCAT GTTTCCCCTG CAAAGGTTGA 
CTGGGAA 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1483 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
GGGAAGCCAA GGTATTTTAT CGGATGAAGT TGTTACTAGT TCTTCACCGA TGGCTACAAA 
AGAGTCTTCT AATGCAATTA CTAATGATTT AGATAATTCA CCAACTGTTA ATCAGAATCG 
TTCTGCTGAA ATGATTGCCT CTAATTCAAC CACTAATGGT TTAGATAATT CGTTAAGTGT 
TAATAGTATC AGCTCTAATG GTACTATTCG TTCCAATTCA CAATTAGACA ACAGAACAGT 
TGAATCTACA GTAACATCTA C T AATGAAAA TAAGAGTTAT AAGGAAGATG T 
CAGAATTATC AAAAAAGAAT TTGAAGATAC TGCTTTAAGT GTAAAAGATT f 
AGGTGATGGG ATTCATGATG ATCGACAAGC AATTCAAGAT GCAATAGATG CTGCAGCTCA 
AGGGCTAGGT GGAGGAAATG TATATTTTCC TGAAGGAACT TATTTAGTAA AAGAAATTGT 
TTTTTTAAAA AGTCATACAC ACTTAGAATT GAATGAGAAA GCTACAATTC TAAATGGTAT 
AAATATTAAG AATCACCCTT CCATTGTTTT TATGACAGGT TTATTTACGG ATGATGGTGC 
GCAAGTAGAA TGGGGCCCAA CAGAAGATAT TAGTTATTCT GGTGGTACGA TTGATATGAA 
CGGTGCTTTG AATGAAGAAG GAACTAAAGC AAAAAATCTA CCACTTATAA ATTCTTCAGG 
TGCATTTGCT ATTGGGAATT CAAATAACGT AACTATAAAA AATGTAACAT TCAAGGATAG 
TTATCAAGGG CATGCTATTC AAATTGCAGG TTCGAAAAAT GTATTAGTTG ATAATTCTCG 
TTTTCTTGGG CAAGCCTTAC CCAAAACGAT GAAGGATGGG CAAATCATAA GTAAGGAGAG 
CATTCAGATT GAACCATTAA CTAGAAAAGG TTTTCCTTAT GCCTTGAATG ATGATGGGAA 
AAAATCTGAA AATGTGACTA TTCAAAATTC CTATTTTGGC AAAAGTGATA AATCTGGGGA 
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ATTAGTAACA GCAATTGGCA CACACTATCA AACATTGTCG ACACAGAACC CCTCTAATAT 1080 

TAAAATTCAA AATAATCATT TTGATAACAT GATGTATGCA GGTGTACGTT TTACAGGATT 114 0 

CACTGATGTA TTAATCAAAG GAAATCGCTT TGATAAGAAA GTTAAAGGAG AGAGTGTACA 12 00 

TTATCGAGAA AGCGGAGCAG CTTTAGTAAA TGCTTATAGC TATAAAAACA CTAAAGACCT 12 60 

ATTAGATTTA AATAAACAGG TGGTTATCGC CGAAAATATA TTTAATATTG CCGATCCTAA 13 2 0 

AACAAAAGCG ATACGAGTTG CAAAAGATAG TGCAGAaTwT TTAGGAAAAG TATCAGATAT 13 80 

TACTGTAACA AAAAATGTAA TTAATAATAA TTCTAAGGAA ACAGAACAAC CAAATATTGA 1440 

ATTATTACGA GTTAGTGATA ATTTAGTAGT CTCAGAGAAT AGT 1483 
(2) INFORMATION FOR SEQ ID NO: 316: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 2453 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 

CCTGAACGCT TTTTTATAAA TATCATAAAG CCAATCTGAT TTATCAAGTG TGTCTAAGCG 60 

ACGCGAATTA AAATTCATTG CATACTCCAT CGCTTCTAAA AAACTCATTT TTGAAAAGAC 120 

GTTAAAATCA TCTAAATTCT GACTCCAATA TAATAACAAA ACCAATCCCA TAATATCCTC 180 

TGGTTGATTA TTCAATAAAT TTAAGTTGGT TTCATAAAAC CCTGGAGTTC CAAATAGAGG 24 0 

CAACTTTTTT TCTTCAATTT GAGTTTCTTT CCTTAGGGCA TGCTCAAAGT CTATAATATA 3 00 

AATATTATTT CTATTATCAA TAAGTATATT ATTAAATGAT AAATCTCTAT AGGAAAGATT 360 

ATATTTGGAG TTTATTATCT CCATATAATC AATTAATGTT AAAAACCAAT CATACGAGCC 420 

ACTAACCATA TTATACTCGC TTAATTTATC TGCAATAATA AACTCAAATT CCACAAAATA 480 

CGAATTCTTT ATGTAAAAAT CGTTAAAAAC TTTTGGAGTA AATTCCTCCT TTTCCAATTC 540 

TACTAATATT TCTCTTTCAT TTATTAAACG ATTCACAGAA TCTCTATTTG TAAAATCAAC 600 

CAACGATAAA TCACTAGCTT CTTTTAATAA AGAATAAACT CGCTTTTGAG TATTAAATAC 660 

TTTATAAACT CCACCTTTGG CATTTTTAGA AATCACTTCC AAAATAATAT ATTGATCAGG 720 

AATAGTGTTA TATCTTGGAA TATAGTAATC CCTTATTGGA ACATTCACAT TTGAAGGGAT 780 

TTTCTTATCT CTTTTATCCT TGAAAGTGCT ATCTTTTACG AACTCCCCAT ATCTGTAATA 840 

TACAACCTCG CTAAGTTGAA ATCTGAAATC TGATGGTATG TTTACACCCT TTACACCTTT 900 



WO 98/18931 



PCT/US97/19588 



1344 

ATACAATATT TCTAATTTGT GTAACAAACG TTGAAACTCT TTATTATCTT TTGGATAAAT 960 

TGTAATGAAT TTCCCGACTT GTGAATAACC ATTAAGCCCT GTATTTTGCA AAGAAAGTTC 102 0 

TTTAATGCTA ACCAAAATTT TGAAATTTAT CTTCTTCTCT CTAGAAAATA TAAAATCAAA 1080 

GAATTTTTTA GCAACCAAAT TAGCATTTAA TATTGAAGCG CTCAGGTGTA TTTTAAATCC 1140 

CTTAGATTGG GTGATATTAG ACGGCAAATT ATATAACCAA TGTTCATCAC TAAAATTATC 12 00 

ACTAATTTTA TATTCTAATA ATAAATTATG GTATGCGTCT TCTATTTCAG TTTCATAGTC 12 60 

CAAATAGTTT AAATACTTTT CGTAATTCAT ATTAAGAAAT CTTCTCCATA AATTTTTAGA 1320 

CCATCATTTA AAGCCAAACA ATTTAAAGCG TGATAATAAA TGTTGATAAT CAATGTAACT 1380 

TTCAGTCCTC TATTTTGTAA TTCCTTCACC AATAATTTTA TGCTATATCT ATTTTCTCGA 144 0 

GGCAATTTAT AGGACTTCAA GATAAAACCA TAAAAGAGAT AAGTATTATA ATCTGACAAT 1500 

CCAGTTTCAG AATAATTTTT TAGAAAAATA TCTAGTGATT CTGATAATTC ATCCGGAATA 1560 

ATTCTTTTAA CATCGTATTT ATTTTTCATA TCGGCCACTC TTCCTTAAAA AGCTCACAAT 162 0 

AAAATTTTAA ATTTCTATAC AACAATCCGA GAGTAGTCTC ACAATTTGAA CATTTCACAT 1680 

CACTCTTAAT ATATAAAAAA TGAATTAATC AGAAACCTCT GACTAAGATT TCCTAATTAA 174 0 

TTCACTTTCT ATATCATAGT AAGGAATTCT ATTATCCCTA ATTGAAAATT GAAATTTTAT 1800 

GTTTTATATA TTAACAATTA TGCGGATTGT AAATCTTGTC TAACAAAATG GCAAGTGCTA 1860 

CTATGTGCCC CAGAAGGCGA TGCAACGCTA TTTTGAATTG AAAGAGCATA ATCATCCATA 1920 

TCATTTAAGT CACGGATTAG CAATGCTTCC TTCTCTCTTC CGACAATTCC AAATTTTCTA 1980 

ATTACCTTTT CAGGATTATC AAAAAATTCT CCAACAACTT CCATATTTCC TTGAAGTTCA 2 040 

TTCAAGAAAG CTTTCATTTG ACTACTCATT ATATAGCTCC TTTTCTATTA CTTTATTTGG 2100 

AATCAAAACT TACTTGTACA TTGGAAACAC CTCTATTCTA CGCTTTCATA TTGCTGCATG 2160 

ACACTTTCAA AATCAAATTG CTAAAAATAA TTTTTTAAAG CTTAATTTAG ATTTAATTAC 2220 

ATATATCTCA AAAAATTGTT TTGAAATTAG TAAATTAAAA TAGGTTTCTG TACTTATAGG 2280 

AACTAGTTAT AAAAACTTCG CCCATCATAA AATATCTATT TAAGTAAAAC AAAAATTTTA 2340 

TAATTTTTTG ATTTTTAAGT GACTATAATC TCCTATCTAT AAATACCATT CGCAGGACCT 2 400 

GGATCAATCC CTCTAGCCAT CTTATGAACT TGAGTTCCTC CAGACAGTCC CGG 2453 
(2) INFORMATION FOR SEQ ID NO : 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1049 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 

CCAATTTGAA GGCTCTAAAA CAATGGAAAA GTGCTACACA GATGTGACAG AATTTGCCAT 6 0 

TCCAGCAgTA CTCAAAAACT TTACTTATCA CCAGTTTTAG ATGGCTTTAA CAGCGAAATT 12 0 

ATTGCTTTTA ATCTTTCTTG TTCGCCTAAT TTAGAATAAG TACAAACAAT GTTGGAACAG 180 

GCATTCAAAG AGAAGCACTA TGAGAATACG ATTCTCCATA GTGACCAAGG CTGGCAATAC 240 

CAACACGATT CTTATCATCG GTTCCTAGAG AGTAAGGGAA TTCAAGCATC CATGTCACGC 300 

AAGGGCAACA GCCCAGACAA CGGCATGATG GAATCTTTCT TTGGCATTTT GAAATCGGAG 3 60 

ATGTTTTATG GTTATGAGAA GAACTTTAGA TCTTTAGAAA ACCTTGAACA AGCTATTGTG 42 0 

GACTACATTG ATTATTACAA CAACAAGAGA ATTAAGGTAA AGCTAAAAGG ACTTAGCCCT 480 

GTGCAATACA GAACTAAATC CTTCGGATAA ATTAATTUTC TAACTTTTGG GGTGCAGTAC 54 0 

ATTTTTGGTA TATATAAAAT TTGTAGGAGC TATATCTACA ATTTTATATT CCCAGTTTAT 600 

GGATGTAACT TACTATATTC ACAATGTTAT CCAGTGTTTT TTCTCTAATA TTTAAGGAGT 660 

GTTCTGTTTC TCGAATAAAT TCTTCAAAGT TTAACCCGTC AACTTGTTCC TGAACAAGAA 72 0 

AATAATCATC CACGATATAA AATTCATCAG TTAAATTAGT AGTATAACTT TTATCGGCTA 7 80 

ATTTTTTTAG CATGTGAGCT TCATTTTTTA TATCATCAAG AGCTGTCCAT TCTCCTTCAG 840 

CATCATAATT CACAAAAGGT CTTGACTGCT TGATGATTAC TTTTTGCCCG TCCGATTTTC 900 

TAATTGCCCG ATAAACATTT CCTTTATTTG ATCTCTTAAT AATTTTTTCC ATTTTGTATT 960 

TATTTATTGC AGAGTCCTTA CTTGAAACTT CACATGTGGT TTGAAAATAA ATCCTTTTTT 1020 

CTTCTTCTGA AAATAAATCC ATTTTCCGG 104 9 
(2) INFORMATION FOR SEQ ID NO: 318: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 776 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 318: 

TTAGTTGGTT AGAATCAGAA AATCGCCGAA GTGGTTATTT ATTTTTGAAT AAATTTAACG 60 

AACCAATTAC AGCAAGAGGA GTTGCTCAAC AGTTAAAAAA TTATGCTGAT AAATACAAAA 120 

TGAATCCTAA AGTAATTTAC CCTCATTCTT TTAGGCATTT ATTTGCTAAG AATTTTTTAG 180 
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CGAAGTATAA TGATATTCCC TTGCTTGCAG ATTTGATGGG ACACGAAAGT ATAGAAACTA 240 

CTCGAATTTA TCTAAGGAAA ACAGCTACTG AACAACAAAA TATTGTAGAT AAAATTGTTA 3 00 

ATTGGTAAAA AATAACAGGT GGTCAAACTG ACTACCTGCT ATTTTTGTGA TTATGGCTCT 3 60 

TATTATGGGA ATATACCTAT GAATTGGGTT GTTATAAAAA TAAAAGATAT TTTTTCAATA 420 

AATACAGGTC TTTCTTACAA GAAGGGCGAT TTAAGCATTA ATAATAAAGG TGTTAGAATT 480 

ATACGTGGTG GTAATATTAA GCCTTTAGAA TTTTCTCTGT TGGATAATGA TTACTACATT 540 

GATACACAAT TCATCTCCTC TGAGCAAGTT TATTTAAAAC ATAATCAGCT AATAACACCT 600 

GTATCAACCT CTTTAGAACA TATTGGAAAG TTTGCAAGAA TCGAGAAAGA CTATGATGGT 660 

GTTGTGGCTG GTGGATGTAT TTTCCAATTA ACACCATTCG AAAGTGCAGA GATGATGTCA 72 0 

AAATGTCTAT TATGTAACTT GTCCTCTCCG TTATTTTATA AACAATTGAA AGCAAT 77 6 
(2) INFORMATION FOR SEQ ID NO: 319: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 658 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 

TGCAATGCGG CGGCTGCATA CGCTTGATCC GGCTACCTGC CCATTCGACC ACCAAGCGAA 60 

ACATCGCATC GAGCGAGCAC GTACTCGGAT GGAAGCCGGT CTTGTCGATC AGGATGATCT 12 0 

GGACGAAGAG CATCAGGGGC TCGCGCCACC GAACTGTTCG CCAGGCTCAA GGCGCGCATG 180 

CCCGACGGCG AGGATCTCGT CGTGACCCAT GGCGATGCCT GCTTGCCGAA TATCATGGTG 24 0 

GAAAATGGCC GCTTTTCTGG ATTCATCGAC TGTGGCCGGC TGGGTGTGGC GGACCGCTAT 3 00 

CAGGACATAG CGTTGGCTAC CCGTGATATT GCTGAAGAGC TTGGCGGCGA ATGGGCTGAC 3 60 

CGCTTCCTCG TGCTTTACGG TATCGCCGCT CCCGATTCGC AGCGCATCGC CTTCTATCGC 42 0 

CTTCTTGACG AGTTCTTCTG AGCGGGACTC TGGGGTTCGA TGTCGACAGC CCGCCTAATG 480 

AGCGGGCTTT TTTTTCCTGA GGCTGGACGA CCTCGCGGAG TTCTACCGGC AGTGCAAATC 54 0 

CGTCGGCATC CAGGAAACCA GCAGCGGCTA TCCGCGCATC CATGCCCCCG AACTGCAGGA 600 

GTGGGGAGGC ACGATGGCCG CTTTGGTCCC GGATCAATTC GCGCGACCGG ATCGATCC 6 58 
(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1475 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 
CCGGCTTAAT TTTTAGAAAA CGTGGGCAGG GAACCTTTGT TCTCTCTCGT GGCAGCTCAA 
AAAGAAAATT AATCGTTCCA GAAAGAGATA TCCGGGGACT GACAAAAATA TCTGAAGATG 
CTCATTCTAC AATTGACTCG AGGATTATTC ACTTCAAATT AGAATTTGCA AATGAATTTT 
TAGCAGAAAA ACTACAGGTC GCTTTGCAGA GTCCAGTTTA TAATATTTAC CGCCTGCGTA 
TTATTGACGG TAAACCTTAT GTTCTGGAAC AAACTTATAT GAGTACCGAT GTTATTCCAG 
GTATTACTGA AGATATTTTA CAAAAATCGA TTTACAATTA CATTGAAGGA AAGTTAGGAT 
TGCATATTGC CAGTGCTACA AAAATCTTAC GAGCTTCTTC TAGTTCAGAA AATGAGCAAC 
ATTACTTGCA GCTCCTTCCA ACGGAACCGG TATTTGAAGT AGAACAAGTG GCTTATTTGG 
ATAACGGAAC TCCGTTTGAG TACTCGATTA GTCGTCATCG CTATGATTTA TTTGAATTTA 
ATTCTTTTGC ATTACGACAT TCCTCCTAGG AGAAAATGTG AAAATGAAGC CAATCTTTTA 
CAGACTCTAG TTTAAGAAAA ATTTAAAACA GGGCAAGAAG GTCCCATCTA TGCTTAAATG 
GTTTCTCTTT TCTAAATAAG ATGGCTTTAA AAGAGTGATC GTTGTATCCA TCATGTTGAA 
AAATATCTTC GTATAGCTTA TAGAGTAGGT ACTGAAATTG TTCACCTGAT CTACTTCTTA 
TAGTTATTTA GTTTTAAATA GTGTTTCAAA CATTCTTACA CTGACGAGAA GTTTTTGAGT 
CTTTTCTTGT AACACATATA GTATACTGTG GTTAGAATAG TAGACTGTGA CTTCTAACAA 
ATTGCTAGAA ATGAATTTCA ATCTCCCAAT TTATTTGTTC ATATCTTCTT TTAATATATT 
AAATAAATTC TAAATCATAA TCATTTAAAA AAATTTTATT TTTTATTTTT CATTACGAAT 
AATATAGATG AAGGGGAAAG AGTATGAAAA CAGAACTGTT TCTTTTGCTA TTAGTTCAAA 
AGGAGAAAAA ATGAAAGTAG AAAATATTTC GTATAGGGTU GATCATCGTA AATTGTTTGA 
TAATATTTCT TTTGATACTT CGAGTTCAGA CGTGACATTA ATTACTGGTA AAAATGGTAC 
AGGAAAGTCA ACTTTACTAT AGTAGATTGA AACTAGAATA GTACACATCT ACTTCTAAAA 
TATTGTTAGA AATCGATTTG ACTATCCTGA TCTATTTGTC CTGTTCTTAT TTCATTTCAC 
TATATCTCAA ATTGAGTATG ACGAAGTGCG CTCCCATGTC CTGGGAACGC ACTTTCTTCA 
TATTTTTCAT ATTCTTGAAT CCATCGATAA AGACTATTGG GATGAATTTT TAAAGTTGAA 
CTAATCATTT TTACAGGATG AGATTTACAG CAGAG 
(2) INFORMATION FOR SEQ ID NO : 321: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 560 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 321: 

GAAATATATA TACTTCATCT TAATAGTGAG CAAGCTAAAC TTAGCATTTC ATGCCCTCAT 60 

ATGGGATGTT CTTTGACTAA ATAATATGAT TATCGAGATA TATCTGGATA AATGAACTAA 120 

TAAGTCTGAC GCGTAGACTT ATCAAAGTCA TTGGCATACA CCACTATGAA CTCGTTGGTC 180 

TGTTCAAATC CCAACACATT ACCTGAGAAG AAAGTTGCAA TGTTGTTTTT GGTGCGGGTT 2 40 

TGAATTTAAA AAATTTGTTA TGTAGTACCT AATCTAAGGA ATTAGAACAA TGCCTCTAAT 3 00 

TTTTCTTTAA TACACTGAAA CATTGATGAT TCTGGCTGTA TTTTTGAAAC AGCTCTTCTT 3 60 

TGCTCCTGGA AAATATCTTC AGAAGTTATA TTCTCTATTC CTAACGCTAC TTGAGTTTTT 420 

TTTCTAAAAT ATTCTTTTCC GTTGCCATCT TTAGAAAAAT CATAACCTTC CCTATCTACG 4 80 

CTGTTACACA AATTAGCTAA AAAArACTCT GGGGTTGGGA AAGGAAGATA AGAAaCGTAT 540 

TTAGCCCATA ATCTATAAAG 550 
(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 643 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 

CCGCCCGGCC ACCGCTGCCT ATCCTCGGGA GAGGGTCACC TGGAGTGAAC CTAGAACGAT 60 

AGACACGGTG CGGTACGACC TCGTACTACT TTCGCCGACG GCCTCGTCCG TTGTCATCCA 12 0 

CGAACTGATC GGACATGGGT GCGAACACTT CAGAGAAAAA ATCGTTGGAC TGCGTGTCGG 180 

GCCTGAGGAA CTACGGGTGG TGGCTTTTCC GAAGAACGGC TCCGGGTTTG ATGACGAGGG 24 0 

TACACCCTCC GAAGAGATTG TACTTGTGGA GAACGGCATT GTGAGGCACG CTGTCAGGGA 3 00 

TCGGGCGACT GGAGGAATGG CGCCTTTTTC CGGTTTGACC AAAGTGGCAT CACATGGTGT 3 60 

CAAACCTGGC TCAAGATGTA CGCATCTCAA GGCGGAAGGG GAATCGTCAC AGGAAGGAGT 420 

TACCGGAGTA CCCGCCGAAC GCACCGTTTG GATAGAGCAT TTTTCTGCAG CGAACTACCA 4 80 

TTCAGGTCGA GCCTTTTTCA GGTCTGGCCT TGCCTGGGTA GGCAGCCGAG AAGAACTCTT 54 0 
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ATATCCCTTA ATGCCTTTCA CCATGTCAAT TGATATCTAC GAACTGGCCA GCTTATTGTG 600 
GCATTTAGAC GGTCAAACGG AACGAGCACG TAGGGTACTG TGC 643 
(2) INFORMATION FOR SEQ ID NO: 323: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 323: 

GGTACCCACT CATTCTTGAT GAATTGTGAA CAGTTGCCCT TGGGTCGTTT TGCGAGTTGA 60 

AGTCAAGAAG AGGAAAAAAA CAAAAAGGAG AAATACTCAT GGCAGTAATT TCAATGAAAC 120 

AACTTCTTGA GGCTGGTGTA CACTTTGGTC ACCAAACTCG TCGCTGGAAT CCTAAGATGG 180 

CTAAGTACAT CTTTACTGAA CGTAACGGAA TCCACGTTAT CGACTTGCAA CAAACTGTAA 240 

AATACGCTGA CCAAGCATAC GACTTCATGC GTGATGCAGC AGCTAACGAT GCAGTTGTAT 300 

TGTTCGTTGG TACTAAGAAA CAAGCAGCTG ATGCAGTTGC TGAAGAAGCA GTACGTTCAG 3 60 

GTCAATACTT CATCAACCAC CGTTGGTTGG GTGGAACTCT TACAAACTGG GGAACAATCC 420 

AAAAACGTAT CGCTCGTTTG AAAGAAATTA AACGTATGGA AGAAGATGGA ACTTTCGAAG 4 80 

TTCTTCCTAA GAAAGAAGTT GCACTTCTTA ACAAACAACG TGCGCGTCTT GAAAAATTCT 540 

TGGGCGGTAT CGAAGATATG CCTCGTATCC CAGATGTGAT GTACGTAtTG ACCCACATAA 600 

AGAGCAAATC GCTGTTAAAG AAGCTAAAAA ATTGGGAATC CCAGTTGTAG CGATGGTTGA 660 

CACCAATACT GATCCAGATG ATATCGATGT AATCATCCCA GCTAACGATG ACGCTATCCG 72 0 

TGCTGTTAAA TTGATCACAG CTAAATTGGC TGACGCTATT ATCGAAGGAC GTCAAGGTGT 7 80 
(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 624 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
CGGGAAAAAT CAGATTGTGG GTTCAGATAT CGAATTAGCC AAGGCTATCG CAACAAAACT 60 
AGGTGTCGAA TTGGAACTAT CTCCCATGAG TTTTGATAAT GTACTGGCTA GTGTTCAATC 120 



WO 98/18931 



PCT/US97/19588 



1350 

AGGAAAAGCC GACCTTGCCA TATCAGGTGT TTCTAAGACA GATGAACGGA GCAAGGTGTT 180 

TGACTTTTCC ATTCCCTACT ATACTGCAAA AAATAAACTC ATTGTCAAAA AATCTGACTT 240 

GACTACTTAT CAGTCTGTAA ACGACTTGGC GCAGAAAAAG GTTGGAGCGC AGAAAGGTTC 300 

GATTCAAGAG ACGATGGCGA AAGATTTGCT ACAAAATTCT TCCCTCGTAT CTCTGCCTAA 360 

AAATGGGAAT TTAATCACAG ATTTAAAATC AGGACAAGTG GATGCCGTTA TCTTTGAAGA 420 

ACCTGTTTCC AAGGGATTTG TGGAAAATAA TCCTGATTTA GCAATCGCAG ACCTCAATTT 4 80 

TGAAAAAGAG CAAGATGATT CCTACGCGGT AGCCATgAAA AAAGATAGCA AGAAATTGAA 540 

AGAGGCAGTT CGATAAAACC ATTCAAAAGT TGAAGGAGTC TGGGGAATTA GACAAACTCA 600 

TTGAGGAAGC CTTATAAGCA TCCA 624 
(2) INFORMATION FOR SEQ ID NO: 325: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1237 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 325: 

TCTTATGAAG CCGAAGCGTG ATTTATGGCG GATAGGTTTG GTCTGCAGAA AGTGACAAAT 6 0 

CTAGTGCCAT CAGCGTATAT GGAATCTnTG GCTGAGAAAC AGTCCCGGGG TGAACTGACT 120 

TATGAGCAGG TTTATGAGGA TGCAACGGCT TATCATCATA CCATTGATGC GAGTACAGAG 180 

GAGGCAGACT TGGTTTCTCT ACGTATTGTA GAACTATTGT CTCGAAGAGG CTTTAGCTTC 240 

AGTCCTGCGA TCTTACTTGC TATTCATAAG GAGTTGTTTC AAGATATATT TGAACCCTCG 3 00 

ATTCCGGTAG GTCAATTTCG TCAGACTAAT ATCACAAAGA ATGAACCTGT TTTGAATGGT 3 60 

GAAAGTGTTG TGTACTCTGA TTACTCCATG ATTCAAATGA CCTTGGATTA TGATTTTAAT 420 

CAGGAAAAAC AAGTTGCATA TGCGACACTA ACCCAGGCGG ATATGGTTAA AAAAATCCAG 4 80 

CATTTTATTT CAGGAATCTG GCAGATTCAT CCATTTCGCG AAGGAAACAC TCGGACGGTA 54 0 

ACGGTATTTT TGATTCAGTA TCTTCGTGAG TTTGGTTTTG ATATTGATAA TACACCATTT 600 

CAGCAACATT CCAAGTATTT TCGTGATGCC TTAGTGTTAG ATAATGCAAA GATTTTACAG 6 60 

CGACGTCCTG AGTTTTTAAC AGCTTTTTTT GAAAATCTCT TGCTCGGTGG TCAAAATGAT 720 

TTGTCTTCAG AAAAAATGTA TCTAGATTTA GACCTCGATC TTTCATAATC CTAATACTGA 780 

GTAAACATTG AATTTTAGGA AAAAATGAAG TAAATATTCT CACAAGAAAA CGTATATCAT 840 

CAAAGTTTGG CTCTTTGTCA ATTGTAGTGG GTTGAAGAAA AGCTAAGTTC GAGAAAGGGC 900 
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AAATTTCGGC CTTTCCTTTT TGATGTTCAG AGCGATAAAA ATCCGGTTTT TTGAAGTTTT 960 

CAAAGTTTCG AAAACCAAAG GCATTGCGCT TGATAAGTTT GATGAGATTA TTGGGCGCTT 1020 

CCAGTTTGGC ATTAGAATAG TGTAGTTGAA GGGCGTTGAT AACCTTTTCT TTATCTTTGA 1080 

GGAAGGGTTT AAAGACAGTC TGAAAAATAG GATGAACCTG CTTAAGATTG TCCTCGATAA 1140 

GTTCGAAAAA TTTCTCCGGG TCCTTATTCT GAAAGTGAAA CAGCAAGAGT TTGAAGAGCC 1200 

GATAGTGATG TATCAAGTCT TGTGAATAGC TCAAAAG 1237 
(2) INFORMATION FOR SEQ ID NO: 326: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 326: 

TTTGATTTTT CTGAATTAGA AGAGATTGAA TTGCCTGCAT CTCTAGAATA TATTGGAACA 60 

AGTGCATTTT CTTTTAGTCA AAAATTGAAA AAGCTAACCT TTTCCTCAAG TTCAAAATTA 120 

GAATTAATAT CACATGAGGC TTTTGCTAAT TTATCAAATT TAGAGAAACT AACATTACCA 180 

AAATCGGTTA AAACATTAGG AAGTAATCTA TTTAGACTCA CTACTAGCTT AAAACATGTT 2 40 

GATGTTGAAG AAGGAAATGA ATCGTTTGCC TCAGTTGATG GTGTTTTGTT TTCAAAAGAT 3 00 

AAAACCCAAT TAATTTATTA TCCAAGTCAA AAAAATGACG AAAGTTATAA AACGCCTAAG 3 60 

GAGACAAAAG AACTTGCATC ATATTCGTTT AATAAAAATT CTTACTTGAA AAAACTCGAA 420 

TTGAATGAAG GTTTAGAAAA AATCGGTACT TTTGCATTTG C 4 61 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(D) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 327: 

TAACATTTAG GTACCTCTTC TTAACAAAGT TCAATAGTAA CAATTAATAT TTTAAACAAT 60 

ATATCAAACA TCAATGACTA GAATACTTGC ATCATCCTTC TTTCCATAGA TTGGATCAAT 12 0 

AGCAGAAGAA TTAAATCTCA TCTTAATTAA CTCTTCAAAA GTTTTATTTT GATTATTTTG 180 
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ATTAATAATA GCATGGTCTA AAAATCTCTC ATCCAACGAA CCTATCCAGT ACCCACTCGG 300 

TTGATTAGAT AATTTTCTGA TTTTTTGTAA AATAATTTTT TTATTTAAAA CACTATTTGT 360 

ACCAATTGAA TCTTTTATCT CATTTTTCCC TTTTTCAAAT AAGTTATCTA CTCTATGATC 420 

AGTTATTTCC ATTTCGTTTA CTAACATGAC GCAGTCACCT AGCATCATAT ACTCCAACTT 480 

TTTTTCTGAA AGTTTAGCAA ATATTGGTAA GCGATAATAT AGTATATTGA AACTAGAATA 540 

GTACACCTCT ACTTCTAAAA CATTGTTAGA AATCGATTTG ACTGTCCTGA TTGATTTGTC 600 

CTATTATTAT TTCATTTTAC TATACTCTGT TAATTTATAT GAGTTTAAAC CGATTTCATC 660 

TTTAACCTCG AGTAAAGCAG TTTCAAATAT TTGTTTAAGA GTTTTTGATT CTTTACAATT 720 

AACCGACAAA CTTTCTGATA AAATATGTAC AACTTCTGAG ACTGAATAAC CTATCTCCTC 780 

TTTAGAATTA TATAAATCTG TAGCTCCACC AATAATCCAA AAATACTGAT TTTGTGAACC 840 

TACAATATCC TCATTTTCTA CGGAACTTCC TTGTATCGAA CAAATTTTAT TTATCTTTAC 900 

CATAATACTT CAACCCTTTT AGTGTCAAAA GTAAACCAAT TCCTGTCACT GTTAAGAATA 9 60 

GTTCCATAAT CTTATTCGAA CCAGTCTTTG GTAATTTTTG TTTkACATCT ACTATyTCTT 102 0 

TAGATTTATT AATATGATTT TCAGTTTCTC TGCCATCTCC AACTATTTTA TAGTTTACTT 1080 

CTTCTGTCTT ATTATCTTGT TTATTGTCGA TCTTGTCATT CATTTGTCTA TTATCTTTAC 1140 

TTGAGTTAAA CTCTCCGTTC TTCTGGTTAC TATCAATTAC ATTATTTGAA TTAGATTGTT 1200 

TTTCCTCTTT GTTTTTTTCT TTTTCGTTTT TATCACTTAA ATTATTTGTT ACAATTTTGT 12 60 

AAAGCCCATT CTCCGTTACA ATATTGAAAT TACCATCGCT ATCACGTATA ACAGGTTCTT 1320 

TCCCATTTGC ATTAGATTTG ATGAATGATA TATACTTACC GGATAAATTA TAAAATTGGT 13 80 

TATTTAAAAC GGTTATTTTA CCCTTTGAAT CCTCAATAAC AATTCCTTCT TTACCC 14 36 
(2) INFORMATION FOR SEQ ID NO : 32 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 646 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328: 

CCGGCAGACA GGAGAAGGTG TTAAATATCA ATCTCAAATG GTTCGTCAAT GGTTTCTGAT 60 

ACGTATTTTC CGTCTTTCTT CCGTTGCTTG ACACACTCTG TGAGGAGATA TTCGATTTGC 12 0 

CCATTGACTG AACGAAAGTC GTCTTCTGCC CATGATGCGA GTGCAGCGTA TAACTTTGTT 180 
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GAGAGTCGAA GGGGGATCTG CTTTTTTTTA GCTTCAGCCA TCTTTAGTAA AGGCTTCCTG 24 0 

TGTTGACAAT TGGTTGTGCA TCATGATTGC CACAAAGAAC GACAAGGAGA TTTGAAACCA 300 

TGGCAGCTTT TCGTTCTTCG TCAAGTTCTA CCAATTCCCC TTCATTGAGC CGTTCTAGTG 360 

CCATTTCAAC CATTCCTACA GCACCATCTA CAATCATCTT CCGTGCATCA ATAATGGCAG 42 0 

ATGCT'TGTTG GCGTTGAAGC ATAACGGCAG CAATTTCTGG AGCATAAGCT AGGTAAGTGA 480 

TACGTGCTTC AAGGATTTCC AAGCCAGCAT CCTCAACACG ACTTTGGATT TCTTCACGAA 54 0 

TACGGGTAGC AACAATTTCG CTAGAGCCAC GGAGACTACC TTCATCTGCG TGCCCATCAC 60 0 

CCGGAGTATC CACATTAGGA GACACATCGT AAGGATAGAT GCGGAC 64 6 
(2) INFORMATION FOR SEQ ID NO: 329: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

GTTGCAGGTG CAGTAGGTGT TACTTCAGAT ACATTTGAAC GTGCAGAGGC TCTTTTTGAG 60 

GCAGGAGCGG ATGCGATTGT TATTGATACT GCACATGGTC ATTCTGCAGG TGTCTTGCGT 120 

AAAATTGCCG AGATTCGTGC TCATTTCCCA GATCGGACTT TGATTGCTGG AAATATTGCT 180 

ACTGCTGAAG GTGCACGTGC CCTTTATGAA GCGGGTGTAG ACGTTGTTAA GGTTGGTATT 240 

GGACCAGGTT CTATCTGTAC TACTCGTGTG ATTGCTGGTG TTGGTGTTCC GCAAGTAACA 300 

GCTATCTACG ATGCTGCAGC TGTTGCGCGC GAATATGGTA AAACGATTAT TGCTGACGGT 3 60 

GGGATCAAGT ATTCTGGAGA TATTGTAAAA GCACTTGCTG CAGGTGGAAA TGCTGTTATG 420 

CTTGGATCTA TGTTTGCTGG AACTGATGAA GCTCCAGGCG AAACTGAAAT CTTCCAAGGA 4 80 

CGTAAATTCA AGACTTACCG TGGTATGGGA TCAATTGCTG CTATGAAGAA AGGTTCAAGC 540 

GACCGTTATT TCCAAGGTTC TGTCAATGAA GCAAACAAGC TTGTTCCAGA AGGAATTGAA 6 00 

GGTCGTGTTG CTTATAAAGG AGCGGCAGCT GATATTGTTT TCCAAATGAT TGGTGGTATT 660 

CGCTCTGGTA TGGGTTACTG TGGTGCAGCT AACCTTAAAG AACTACACGA TAATGCTCAA 720 

TTTATTGAAA TGTCTGGTGC TGGTTTGAAA GAAAGCCATC CTCATGATGT GCAAATTACT 7 80 

AATGAGGCAC CAAATTATTC TATGTAAAAA ACAATGAAAA GAACTCCAGT GAAAACAGGA 840 

GTTCTTTTAC AATGTTGTCA ATTTCCATTT ACAGCAGCTT TACCATCCTG AATAGTGAAG 900 
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ATACTTAGAT TTTCTGGCAG ATTTTGAAGA TGGTCTAAGC TTGTTGTTGT GATAAAGGTT 9 60 

TGGATTGATT GAGAAATCGT TTCTAATAAT TTTAACTGTC TAGTGTTGTC AAGTTCACTC 1020 

ATCACATCGT CAAGCAGTAA TATAGGAGAT TCTGTGGTAA TGCTTTCCAT TAATTCGATT 1080 

TCTGCTAATT TTATCGAGAG GACGAGACTA CGATGTTGAC CTTGGCTTCC GAAACTAGCA 114 0 

TCCATCCCAT TTATATAAAA AGAAATGTCA TCTCGATGAG GACCGACACC AGTATTCTTT 1200 

TTAAATAAAT CTCTGGATCT ACTTTTTTCT AAAGCAATTT TGAAAGATTC GGATAAGTTT 12 60 

TGTTTGTCAG TTATATTGAC AGAAGATTGA TAGGATATTG ACAACTCTTC GATCTGATTA 132 0 

GAGAGTTCAA AATGTTTCTT ACGCCCAAAT GATTCTAGTT TTTTTATGAA ATCTAAGCGG 1380 

TGATTCATTA CACGACATCC ATAATCAACT AGCTGATCAT CTAACACAGA AAGGAATGTT 144 0 

TCATCTATTT TTTGAGCTGA TTTTAGGTAA GTGTTTCTTT GCTTTAGGAT GTGGTTATAA 1500 

TTGGTTAAGT CAGATAAATA GATTGGCTTA ATTTGCCCAA GTTCCATATC AATGAATTTT 15 60 

CGTCGAATCG AAGGTGCTCC TTTAATTAGT TGTAAATCTT CAGGAGCAAA TAAGACAACA 162 0 

TTCATGTGTC CTACATAATC TGAAAGGCGT GCC 1653 
(2) INFORMATION FOR SEQ ID NO: 330: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1340 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

GAAACACTGT ATTTCAAAGC ATTTTTTGTT AGTTTAAAAT TACTCCCATT CTTCTTTTCC 6 0 

AAACGTACAA TATATCCAAA ACCATTCAAA AT AC TAG ATT CTATTTTTTA TAATATCACT 12 0 

AAATCCACCT AATTATAGGA CGTTTTCAGA TTTTTAGTCC CAGTCCCAGT ACCGGAGAAA 180 

TATTGTTTTA ATATAATATC TCTTTTTGTC TTCTAAGCTC TTAAAAGCAA AAGAACAAGT 24 0 

AAAGAGTCAA GACAAGGATA AAAAGTCCAT ATTAGGGCAA ATAAAAAGCT TTAAGACAGA 3 00 

TGACAAATCT AAGTCAAATA AGAAAGACCA TAGCAAAGGT GCAGAGAGAT AAATATTGGC 3 60 

GGTCTTCGGA CTGCCTTTAT TTTTTTATCC ATTTTTCAAA TCAAATTTAT TCAGACTATA 42 0 

TATGCACATA TACACTTAAA TTCATATAAA AACATGGCTT GTAAAAAATT ACTTTAATCA 480 

CAATAATCGC ATTTAAAATT GTGATGTTTG CAAGCTAAAT TACGGACTTC ACTTGGAAGT 54 0 

TTTCCCTTGT ATCTTTTATA ATAGATAGAA AATTTGCTGG CAGATGAATA TCCAACAGAT 600 

TCTGCTATCT CTTTTATAGG TAGTTCAGTG TTTAAAAGAA GAGTTTCAGC TACATTCATT 660 
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CTTTTTCTTT GAGTGTACTC TGTAATGCTT TGACAATATT TTTCCTTAAA TAAATTTTTT 

AATTTAGTAC CACTCATTTT AGATATTTTT TCAAGCGTGC CTTGATTTAC ATTCGTTGCA 

AAATGATCAT CTAAGAATCT TGCTACATCT TCAAGTGCTT TATCATCATC AATTTCAATT 

TTATATTTTT TTCTATTTAA GTATGTGTCA ATTACTATAC TTATCCATTC ATTTGCCTTT 

GCTTTAAAGA AAAAATCAGC GGCAGGAGCG TCCATCTTAC AATTTAATAT TTCCATTGCC 

ACTCTTTCTA AGGCCTTTGT AAGTATTATT TGATTCGGTT GAAGCAAGGT TGAATAAAAA 

GATTCTGGAT TAATGTTAAT AGATGCTAAA TGTTTTTCTA TTAGCTCTTT TTTAAAACCm 

ATGGAAACAG CAAGATAACA ACAATTCTCG TGTAATAAAA AAACAAAATT ATCTTTTATA 

TTATCAAAAT CAAAAGTACA TAGAGAGTTT GCGGTAATAG TTTGATACGG ATTAAACTTT 

TCTCCGTTTG CACTGACAAT GTAACTTGAA TAAATTGAAA CATAGTCTGA CATACTATAA 

GTGCTATTTT GAACTACTTC CTCTTTGATA TAAAAATCAT GTATATCGAT AATGAAGATG 

CCTCCTTCAT AAAACCGGTA 

(2) INFORMATION FOR SEQ ID NO: 331: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 607 base pairs 
(D) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
(D! TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 
TATGTTCGTG ATGAGTTTTT AAGTAGGAAA AACGTGCTAA CCTCTCAGAT TTTGGAACTT 60 

GTAAAAGAAA CTCTTTTTTC ACCCGTAGTA GTTGATAATG GGTTTGATCC GGCCTTATTT 120 

GAAATTGAGA AAAAACAATT GCTAGCAAGT TTAGCAGCTG ATATGGATGA TTCTTTTTAT 180 

TTTGCACATA AAGAATTGGA TAAATTGTTT TTTCATCATG AACGTCTTCA ATTGGAATAT 240 

AGTGATTTAC GAAATCGTAT TTTAGCTGAA ACTCCACAAA GTTCTTATTC TTGTTTCCAA 300 

GAATTTTTAG CCAATGATCG AATAGATTTC TTTTTCCTAG GTGATTTTAA TGAGGTTGAA 3 60 

ATTCAAAATG TATTAGAATC ATTTGGCTTT AAAGGTCGAA AAGGAGATGT GAAGGTTCAG 420 

TATTGTCAAC CTTATTCTAA TATCCTTCAG GAAGGTATGG TTCGGAAAAA TGTGGGACAA 480 

TCCATTTTGG AATTAGGTTA TCATTACTGT TCTAAATATG GTGATGAGCA ACATTTACCC 540 

ATGGATTGAA TGAATGGTTT ACTTGGTGGA TTTGCTCACT CTAAGCTCTT TACAAATGTC 600 

CGGGAAA 607 



WO 98/18931 



PCT/US97/19588 



1356 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

TTAAAATACC GAATTTTGTT TTGTCCTCTA TTTCAACATT GTGAATCGCC TCAGGCAGAG 60 

AACCGATACT AAAGATATAA CCAAAATAGT TGTCATTTGC TTTACCGATA TCAATCTTAT 120 

TGGTTAAATC AAAATCCAGT TCGTCAATTG CGCCATCGAT GTCTTGATTG ATTTCCAAAA 180 

GTTTTGTAAT GAGGTTACCC GTACCGCCTG GGATAATCCC TAACTTAGGA ATGTAGTCTC 240 

TCTCATCAAT ACCTGAAATG ACTTCATTGA CAGTTCCATC TCCACCAAAC ACAACCACTG 300 

CATCATACTG CTCACGAGAA GCTTCTTCAG CAAAATGTGT TGCATCCAGC GCTTTTTCGG 360 

TAATTTTGGT TTCAACATAT TCAAAGTATT CTTTTGCTTT ATTCTCCAGC TTTTCTTTGT 420 

AATCCAAAGC CTTCTCGCCA CCAGAAGTAG GGTTGATAAT TACCATTGCT TTTTTCATTG 480 

ATTTTATCCT TAATTTTAAA CAGAAATGTT TACATTTCGT CGTATGCAAG TAAATGTAAT 540 

CCTATTATAC AATGAAAATA CAGAAAAGAG AAATCTGACG TACTGGAGAT TAATACGCTT 500 

TTATTCTATT TTCCCATCGC CTAACTACAT CCTTTAAGGG TTCATCCAAG TAAGAATAGG 660 

CCTTATCCTT GATCCAATCA GGAATACCGT AAGCTGCCTC TGCTAwGCTA CAAGTGATTG 720 

CTGCGAGAGT ATCACTGTCG CCACCAAGTG AGATGGCATT TCTTATCGCA TCTTCGAAGT 780 

CTCTACTTTC AAGAAAGGCG ATAATGGCTT GAGGGACAGT TTCCTGACAT GTTTCGTTAA 840 

AACGATAGTT AGGACGGATT TCATCTAAAG TTTGAGATAG ATTGTAATCG TATTCTTTTT 900 
(2) INFORMATION FOR SEQ ID NO: 3 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

CCTTTCTGGC ACACTGGTCT TGGAATACGG CAAAACCTCT GAAAATATCT ATGCTGGAAT 60 

GGACGAGGAA TACCGTCGTT ATCAGCCTGC CATCATCACT TGGTACGAAA CAGCCAAACA 120 

TGCTTTTGAT CGCGGACAGA TTGGCAAAAT ATGGGTGGAA TCGAAAACGA CCTCAAGGGC 180 
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GGTCTCTACA GCTTTAAATC CAAGTTCAAT CCGACCATTG AGGAATTCGC TGGTGAGTTC 240 

AACCTGCCAA CTAATCCTCT TTACCACCTC TCCAATCTGG CCTACACTCT CAGAAAGAAA 300 

CTGCGCAGaA GcATTAACAG AAAGGAAGCC TATGACCTTT AAACTTCTCA GCCAAGAAGA 360 

ATTCATCCAG CATACCTCAG CTAGATCCCA ACGCTCTTTT ATGCAGACCG TAGAAATGGC 420 

AGAGCTGCTG AGCAAGCGTG GCTTCAGTAC CCAGTATGTC GGCTACACTG ACCCACAAGG 480 

GAAGGTAGTG GTGTCAGCTG TCCTCTACAG CATGCCTATG ACTGGTGGCC TTC 533 
(2) INFORMATION FOR SEQ ID WO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 334: 

CCAGCAAACT AGGAAGCTAG CCGTAGTTGC TCAAAGCACA GCTTTGAGGT TGTAGATAAG 60 

ACTGACGAAG TCATGTACAA AACACTGTTT TGAGGTTGCA GATAGAACTG ACGAAGTCAC 120 

TCAAAACACT GTTTTGAGGT TGCAGATAGA ACTGACGAAG TCACTCAAAA CACTGTTTTG 180 

AGGTTGCAGA TAGAACTGAC GAAGTCAnnA ACCACACCTA CGGCAAAGTG AATCTGAAGT 2 40 

GGTTTGAAGA GAGTACAACT TGTCTTTTAG AAAAGGAGCC TATAATGAAA GTCTTTCAGC 300 

ATGTAAATAT CGTGACTTGT GATCAAGATT TCCATGTTTA TCTTGATGGA ATCTTAGCAG 3 60 

TCAAGGATTC TCAAATCGTC TATGTCGGTC AAGATAAGCC AGCGTTTTTA GAGCAAGCTG 420 

AGCAGATTAT AGACTATCAG GGAGCTTGGA TTATGCCTGG TTTGGTCAAT TGTCACACCC 4 80 

ATTCTGCAAT GACAGGTCTG AGAGGGATCC GAGATGACAG CAATCTCCAT GAATGGCTCA 54 0 

ATGA 544 
(2) INFORMATION FOR SEQ ID NO: 335: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 349 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 
CCAGGAACTC AAATGTAAGT AGGGGTTCCT TTTTTGTATA TTTTTCAAAT AACGCCTCTA 60 
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CACTATTTGT AGCAAATTCA CCAACTACAG TTGTATCTTA GTTAAAATAA GTTAGAATAT 12 0 

GTAAGTGAGT ACCAGATATA CCAAGACATC GTCACCATCT AAGGTATATT CAAAATACAA 180 

AAGTTGACCA ACTAGATTTC TGAATATCCT TATATATCCA TTCTTAAAAT TGGTTTAAAT 240 

AGCGTAGTCT TTTAAACTAG TTTTGAGAAT CCAAAAAATC TTCCTACATA TGTAAGAAGA 300 

TTTTTTAGTT CAGAATGATT AGaTTTAGCT AATGGATACC TATCCTACC 34 9 
(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

CTCCGATAAC CACACCAGCA ATGGAAATAA TTCCATCGTT AGCATCAAGA ACACCCGCAC 60 

GCAGGATATT TAAACGACCT GCAAAATTTG AATCAATTTC GTGATTTGTT TCTGACGCTA 12 0 

AATTTCAAGT TCAAGTTAGC CATCAAGAAG TCTTCTCTGG GTGACTTGTA GTCCAAGCAT 180 

TTTTTAGGAT AGTTGTTAAT CCACTTTTCG ATGAATGCGA CTTCTTTGGG AGTCATTTTC 240 

TTGGTTCCCT TAGGTAACCA TCTACGAATG AGCCTGTTGT GATTCTCATT AGTTCCCGGG 300 

ATCCTCTAGA GTCGACCTGC AGGCATGCAA GCTTGGCACT GGCCGTCGTT TTACAACGTC 3 60 

GATGACTGGG GAAAACCCTG GCGTTACCCA ACTTAATCGC CTTGCAGCAC ATCCCCCTTT 420 

CGCCAGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 480 

CCTGAATGGC GAATGGGGCC TGATGCGGTA TTTTCTCCTT ACGCATCTGT GCGGTATTTC 540 

ACACCGCATA TGGTGCACTC TCAGTACAAT CTGCTCTGAT GCCGCATAGT TAAGCCAGCC 600 

CCGACACCCG CCAACACCCG CTGACGCGCC CTGACGGGCT TGTCTGCTCC CGGCATCCGC 660 

TTACAGACAA GCTGTGACCG TCTCCGGGAG CTGCATGTGT CAGAAGTTTT CACCGTCA'J'C 72 0 

ACCGAAACGC GCGAAACGAA AGGGCCTCGT GATACGCCTA TTTTTATAGG TTAATGTCAT 780 

GATAAGGATG GTTTCTTAGA CGTCAAGTGG CACTTATCGG GGAAATGTGC GCCGAGACCC 840 

TATTTGTTTA TTTGTCTAAA TACATTCAAA TATGTATCCG CTCGTGAGAA AATAAACCTG 900 

ATAAATGCGT CAATAATATT GAAAAATGAA GAGTATGAGT ATTCTACATT TCCGTGTCGC 960 

CCTTATACCC TTTTTTGCGG CATGTTGCCT TCCTGTTTTT GCTCACCCAG AAAACGCTGG 102 0 

TGAAAGTTTA AGATGCTGAA AAATCATTTG GGTGCACAAC TGGGGTTACA TCCAACTGGA 1080 

ATCTCCAnCA GCAGTTAAGA TCCTCTGACA GTTGTACACG CCGCAAGAAC TATTCCCGAT 1140 



WO 98/18931 



PCT/US97/19588 



1359 

GAATGAGCAA CTTTTAAAAG TCCTGCGAAT GTTGGGGCGG TAATAATCCC CGTGTTGTAG 1200 
GCCCGG 1206 
(2) INFORMATION FOR SEQ ID NO: 337: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

CTGCTCAACT CAGACAGTCA AATTTCTGAC TTTACCAAAA GAACCATCAA AAAAGTTGCT 60 

GAAAAAGGCC ATCAGGTTAT TATTACGACA GGTCGCCCTT ACCGTATGTC AAAAGATTTT 120 

TACCGTGAAC TGGGCTTAGA CACTCCTATG ATTAACTTCA ACGGATCCCT TACTCATTTA 180 

CCAGACCAAG TTTGGGATTT TGAAAAGTGT TTGACTGTAG ACAAAAAATA TCTGCTAGAT 240 

ATGGTTCAAC GTTCAGAGGA CATTCAAGCC GATTTTATCG CTGGAGAATA TCGTAAAAAA 300 

TTCTACATTA CAAATCCCAA TGAAGAAATT GCCAATCCCA AACTATTTGG TGTAGAAGCT 3 60 

TTCCAGCCTG AAGATCAATT CCAGCCTGAA TTGGTGACCA AGGACCCTAA CTGTATCCTC 420 

TTGCAGACTA GAGCCAGTGA CAAATATTCC TTGGCAAAAG AAATGAACGC CTTCTACCAG 4 80 

CATCAACTTT CTATCAATAC CTGGGGAGGT CCGCTCAATA TCCTTGAATG TACCCCAAAA 540 

GGTGTCAACA AGGCCTTTGC TTTGGACTAC TTGCTCAAGA TAATGAATCG TGACAAAAAA 600 

GATTTGATTG CCTTTGGAGA TGAACACAAT GATACCGAAA TGCTCGCTTT TGCTGGGAAG 660 

GGTTATGCCA TGAAAAATGC CAATCCAGAG CTACTCCCTT ATGCAGATGA GCAAATTTCC 720 

CTTACCAACG ACCAAGATGG GGTTGCCAAA ACCCTACAAG ACTTATTCTT ATAACCTATA 7 80 

CTGATACTCA ATGAGGGGCA AAGAGCGAAC TTA 813 
(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 683 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 338: 
CCTAGATAAA TGATATAATT CTATTATTGT TCGTAAAAAT TAAAAGGAGA TTGATGATGG 60 
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ACAAATTATT TAAACTAAAA GAGAACGGTA CAGACGTTCG TACAGAGGTT CTCGCTGGTT 12 0 

TAACAACTTT CTTTGCAATG AGCTATATTC TCTTTGTAAA CCCACAAATA CTTTCACAAA 180 

CAGGAATGCC TGCTCAGGGC GTCTTCCTAG CGACGATTAT TGGTGCAGTA GCGGGTACCT 24 0 

TGATGATGGC TTTTTATGCT AACTTACCTT ATGCCCAAGC GCCAGGTATG GGACTCAATG 300 

CCTTCTTTAC CTTTACAGTT GTATTCGGGC TTGGTTATTC TTGGCAAGAA GCCCTAGCTA 360 

TGGTCTTCAT CTGTGGGATT ATTTCATTGA TTATTACCTT GACAAATGTT CGTAAAATGA 420 

TCATTGAATC GATTCCCAAT GCTCTTCGCT CAGCTATTTC AGCTGGTATC GGTGTCTTCC 480 

TTGCCTATGT AGGGATTAAG AATGCTGGAC TTTTGAAATT CACGATTGAT CCAGGCAACT 540 

ATACTGTTGT AGGAGAAGGG GCTGACAAAG CTCAAGCAAC GATTGCAGCA AACTCTTCAG 600 

CAGTTCCAGG ATTGGTCAGC TTTAATAATC CAGCTGTTTT AGTGGCTCTT GCAGGACTTG 660 

CCATTACTAT CTTCTTTGTC ATC 683 
(2) INFORMATION FOR SEQ ID NO: 339: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 339: 

CTACTTTACA TGGAAGTAGT CACTGAATTC CAGTTAGAAA TTACTTTGTA ACTACGTTTT 60 

GAGGAGGAGT AAAATGCTTT CCTACGTTCG ATATTACCCA CTAGCGATAG CTAAATTAAT 120 

GTGTCTGTGC TCTCCTAAAA TCTGCTGATT TATTACTGAC TAATACAGGA GGTTTTTTTT 180 

ATGgACAGAC AATCATATCT GCTATTGGTG TTTATATTTC CACCAGTATC GATTATTTAA 2 40 

TTATTTTAAT TATTTTATTT GCACAGCTAT CACAGAATAA ACAGAAATGG CATATTTATG 3 00 

CGGGGCAATA TCTAGGCACA GGCTTACTTG TAGGGGCGAG TTTAGTTGCT GCTTATGTCG 3 60 

TTAATTTCGT GCCTGAAGAA TGGATGGTTG GATTGCTTGG TTTAATCCCT ATCTATTTAG 4 20 

GGATTCGCTT TGCAATTGTT GGAGAAGATG CGGAAGAAGA AGAGGAAGAA ATTATTGAAA 4 80 

GATTAGAACA AAGCAAGGCA AATCAACTGT TTTGGACAGT TACATTGCTG ACAATTGCGT 540 

CTGGCGGAGA TAATTTAGGT ATCTATATAC CTTATTTTGC TTCGTTAGAT TGGTCACAGA 600 

CCCTCGTGGC CTTGCTTGTG TTTGTAATCG GCATAATTAT CTTTTGCGAG ATTAGTCGGG 6 60 

TGTTATCCTC TATTCCGTTA ATATTCGAGA CAATTGAAAA ATACGAGCGA ATCATTGTGC 720 

CCTTAGTATT CATTCTACTT GGACTATACA TCATGTATGA AAATGGCACG ATAGAGACTT 7 80 
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CTGATTTTCA GA 

(2) INFORMATION FOR SEQ ID NO: 340: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 754 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 

CCGCACAAAA GCGCATAGTA TCAAGATTCT ATAAAGCCTT GATACTATGC CTTTTTAATG 60 

GATAAATAGT TAGTCTTTTT TAAAGACCGG ATCTTTCAAA CTCTGCATAC TGGCATTGAT 120 

CACCGCGCCT AGGATAACAA TTTTAGCAAT CAAGATAAAC CAAAACA'I'CA TAACAACAAG 180 

AAGAACGGAA CCTAAAATTC GGACATCCAC CAAATGATGG ACATAGTAAT TGAGATAACT 240 

AGAGAACAGA GTTAGTAAAC CTAAAATCAC TAAGAGAACA AAGGCACTGC CTGGTAGGGT 300 

ATAGCTAATT TTCCTGTTAG ATAGATTGGG AAGAAAATAA TAAAGCATGA CCAAGATAGC 3 60 

AAAGAGGAGG GCGTAAATCA GAGGACCTGC CAACCCTTGT AAAGCCTGAT AGATAATGCC 42 0 

ATCTTTTGTC CAATAATGAG CAAGTAAAGC CAAAATCATC TGACCAAATA AGATCAAAAA 4 80 

CAAGGCAAAC GCAAAGAGGA GCTGCAACCA AAACTGACTA GGAGACTTAG CATCTGATGG 54 0 

GAAATAAGTC CACGACTCTT TTCGACGCCA TAAGCCTTGT TAAAAGCTTT TTGCAAGAAA 600 

TTCATAGATT TTGAAAAACT CCATAACGCC GATAAAACAG AAAAACTCAA TAAACCTGTT 660 

GAAGGTTGCG TCAAGACTTC TCTGGCTATT TTTTCCACAC CTTCATAGAG GCTTGGGGGG 720 

CAGACGTCTT TCATAAAGCC CAAAAATTCT CCCA 7 54 
(2) INFORMATION FOR SEQ ID NO: 341: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 
GGGGATAACT CTAGGAGTAC CGCTATTACT CGACTTAATG AGTGCACAAG AAGTCAGGAT 
TTTTATGCAG GTTGGGCGCT TCATCAGACA GGGAAGATTT ACAGCGACTA TTATGGAAGT 
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CAAGGTTTGC TTTATTATTT GCTGACTTAC GTGAGTCAGG GCGGATTTTT CTTTGCCATC 180 

TTTGAGTGGT TAGCCTTGGT AGCAGGAGGA TTTTTCCTTT TTAGATCAGC GGACACCTTG 2 40 

ACAGAGCAAG GAGACCAAGC TGGACATCTG GTGACTATTT TTTACATGCT AGTTACAGGT 300 

CTTGCTTTTG GTGGAGGCTA TGCGACTCTT TTAGCGCTTC CTTTCTTATT CGCAGCCTTT 3 60 

AGTTTAGTTG CGGCTTACCT AAGCAATCCA AGCCATGATA AGGGATTTGT ACGGATTGGG 42 0 

CTAGCTTTGG CAGGCGGATT TTTCTTTGCT CCCTTATCAT CGCTCCTGTT TATTGCTGTA 480 

GTGAGTTTAG GCTTGTTGGT CTTTAACCTT GGGCATAGAC GCTTTGCGCA TGGGTTTTAT 540 

CAGTTTCTTG CAGTGGCTTT AGGTTTTTCA CTTGTCTTTT ATCCAACTGC CTACTATAGT 600 

GCTGCAACAG GAAGTTTTGG GGATGCGwTT AGTGGTATTC GTTATCCTAT TGACAGTATT 6 60 

CGCTTTGATT TTACTTCTAA AATTTTAGAG AATATGTTTT TTTAAGG 7 07 
(2) INFORMATION FOR SEQ ID NO : 342: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

GGATTTTGAA AAACCATACC GATTTGACGA CGTATATTCC AAACATTTTC CTCAGTCAAA 6 0 

CGTTGGCCAT CAATTACAAT CTCTCCGGAT TCTGCTTCCA GTAAGCCATC AATTAATCGA 120 

ACCGTCGTTG ATTTACCACT ACCATTATGC CCTACAATCG AAAGCCATTC TCCACGTTTC 180 

ACGTGAAAgT AATATCCTTC ACATCGTAGT AGTTCTGATT TTCTTTATAG CGAAAAGAAA 24 0 

GATTTTTTAC ATCAATTATT GATTTCATTT CGAACCAAAT GTCCCTTTAA ATACATAGGC 300 

ACTACCCTTG AAATAGTCAT AGCCAGAGTA GATAGTGAAA AATAAGGCTA CATAAAGTAG 3 60 

AACTTGACCA AGCAAAGTCC AATGTAATAG CAAGAAAATA ATGGCAAACA TCTGACTAAA 42 0 

AGTTTTAATT TTTCCAGGCA TTGCTGCTGC TAAAATTGTT CCACCAGTTT CAACCAATAA 4 80 

AAGCCTTAAA CCTGTCACAG CTAACTCACG ACAGATAATC ACTGCAACAA TCCAAGCCGG 54 0 

AGCCATACCT AACTCAATCA ACATAATAAA AGCCGACATA ACTAGTAACT TATCCGCCAT 600 

AGGATCTGCA AATTTACCAA AATTACTGAC CACATTCCAT TTACGAGCTA AATATCCATC 660 

TAAATAGTCG GTAATACTGG CAACAGCAAA GATAATAGCT GCAACTATAT GACTCTCTAT 720 

CGAATTTCCT ATCGTTAAAA TAAAGATAAA AATAGGTATA AA 7 62 
(2) INFORMATION FOR SEQ ID NO: 343: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: 

CTTTTGATAC ACTTAAACTA TGAATACAAA TCTCAAGCCC AAACTTCAGC GTTTTGCTTC 60 

TGCGACTGCC TTTGCCTGTC CTATCTGTCA AGAAAATCTG ACTCTGTTAG AGACTAATTT 12 0 

CAAGTGCTGC AACCGTCATT CTTTTGACTT GGCGAAATTT GGCTATGTCA ATCTAGTCCC 180 

TCAAATCAAG CAATCTGCTA ACTACGACAA GGAAAATTTT CAAAACCGTC AACAAATCCT 24 0 

AGAAGCCGGC TTTTACCAAG CTATCTTAGA TGCTGTATCT GACTTGCTTG CAAGCTCAAA 30 0 

AACTACCACA ACAATTTTGG ATATCGGTTG TGGTGAAGGA TTCTATTL'TC GCAAACTACA 3 60 

AGAAAGTCAC TCTGAAAAAA CTTTCTATGC CTTTGACATC TCCAAACATT CAGTCCAAAT 42 0 

CGCGGCTAAA AGTGAACCCA ACTGGGCAGT CAATTGGTTC GTTGGCGACT TGGCACGACT 48 0 

TC 482 
(2) INFORMATION FOR SEQ ID NO: 344; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 520 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

TTTATTTTTA TAAAGTCAAT ACCTGTCTTT ACTTTTTCTT AAAAAAAGTT TATTATGTTC 60 

TTTAAGGAGG TGTAAAACAT GAAAATAAAT AATAAACTCG TTGGAGAACG TATTCAAAAT 12 0 

ATCCGTTTAA GCCATGGCGA CTCTATGGAA AAATTTGGAG AAAAATTTAA TACTAGCAAA 180 

GGTACAGTTA ACAACTGGGA AAAAGGTCGC AATTTACCAA ATAAAGAAAA CCTACTAAAA 240 

ATTGCATCTA TTGGAAAAAT GAGTGTTGAA GAGTTACTCT ACGGCGATTA CAATACTTAT 300 

CTACACTTAA AGATTATGGA TTTAGCTCCT GAATGTATAA AAAATTATGA TGAGTATAAC 360 

TCTTTACACG ATGATATAAC AAATAAAGCG TTACAGATCG CTCAAAATAC CATTTCTAAG 42 0 

ATTGATTATC AAATTTCAGA CGAAACGATC AAAAAATTTA TTGATTTAGC TATCGAACAA 480 

TCGAGAGATT TGCAAGGAAA TTTGTTGAAA AATAACGGGT 520 
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(2) INFORMATION FOR SEQ ID NO: 345: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 345: 

GCATCAAATC CGCCATCAAA GAAGTTCTCT GGATTTACCA AGACCAGTCA AATAGCTTAG 60 

AAGTGCTTAA TGACAAGTAC AATGTTCACT ACTGGAATGA CTGGGAAGTT GGAGACACGG 120 

GAACCATTGG TGAGCGCTAT GGTGCCGTTG TTAAGAAACA CGACATTATC AATAAGCTTC 180 

TCAAACAGTT GGAAACCAAT CCTTGGAACC GCCGCAATAT TATTTCGCTC TGGGATTACC 2 40 

AAGCTTTCGA AGAAACAGAT GGGCTGCTCC CGTGCGCCTT TCAGACCATG TTTGATGTTC 3 00 

GGCGTGTTGA TGGGGAAATC TATCTGGATG CGACCTTGAC CCAGCGCTCC AATGATATGC 3 60 

TGGTGGCCCA CCACATCAAC GCTATGCAGT ATGTGGCTTT GCAGATGATG ATTGCCAAAC 420 

ATTTTGGCTG GAAGGTTGGG AAGTTCTTCT ACTTCATCAA CAACCTCCAT ATCTATGATA 4 80 

ATCAATTTGA ACAAGCTCAG GAATTGCTCC GTCGGGAgCC GTCAAACTGC CAACCACGCT 54 0 

TGGTTTTAAA TGTTCCTGAT GGGACTAATT TCTTTGATAT CAAAGCAGAA GATTTTGAGT 600 

TGGTGGATTA TGACCCTGTT AAGCCACAGT TGAAGTTTGA CCTAGCTATT TAAAAGAATA 660 

GAAAAAAGAA GTTGAGAATA ATCCCAACTT CTTTTGTTTC TTAACGTGAT ACGCGGCGAC 720 

GAGCTGCTTT TTTACGGTTT TCTTCGATGA AAGCTGCTTT TTGCTCTTCT GGTTCGATTA 7 80 

CTTTCTTTTT AAATGCGTAT ACTGCACCTG CAACGGCAGC GACAGTTCCT GCGACACCTG 840 

TTACAAGACC TTTAGCGAAT CCTTTAGCCA TGAGTCTTCC TCCTTTATAT TCTCAATCAG 900 

CCAGCCTCCT CAAGAGGTCA CATTTTTCTG ACTGACCTTT TTGTGTTATA ATAATAGTAA 960 

CGAAAAAATG GGAATTTTTC AAGGAAAAAA GATGAGAACA AAA 1003 
(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 346: 
CCGCACGTAC TATTCCAGAT GCCGAGGAAG TGGACCTCAT CCTCGTTGGC GCAACTGGTC 60 
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TCAACGCCTT TGAACGCCTC TTGGTCGGCT CTTCATCTGA ATACATACTC CGCCATGCTA 120 

AGGTCGATTT GCTGGTTGTG AGAGAACAAG AAAAAACCTT ATAATCACAA AGAAAAGGAG 180 

CCCCTAGCTC CTTTTTGTTT ACGATTTATT TCTCTCTTTA TGGCGTTCGT AAGCCTTGAG 240 

CTGGCGCTGC AGTTCCTTTT TAATAGCAGG TTCTGGAGCA TATTTTTCTT CCCAATTATC 300 

TGGTTTTAAG ATTTTATGGG TCACTGGATC AAAATGAGCC TTGCCATCTG GAAAAATTTT 360 

CCCCATATTG GCCTGATGGA CAATATCAAA AATACGTTCT GGGTCCACCC CCATCAAGAC 420 

AAAACTGCCG TAGGTGAAGT AAAGCGTGTC AATCAAGGCA TCCACTTGCC CTATCAAATC 480 

TTGCTGAGCA GGTGTCTTCT TGGCTACTTT ATCTGCTGCC TTATCAAGGG CCTGATGAAG 540 

TTGCGATACA GCTTGACCAA AATCTTCTTC AGAAGGACTG GCTGCTCGAA CAAACTCCAC 600 

CAATTCTTCT ATTTTAAAAC CAGCCCTATG GGTTGCACCC TCTAAATCCC AAGCTCGAGG 660 

TTCTTCTTGG GTTCGTTCAT CCATCATGTG GTGGAAAGTC TTGACCTTAT TGAAATGATA 720 

GTCACGGCTG ACAAAGACTT TTTCTGAAGA 7 50 
(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 347: 
CGCAACATAC GGATAACCTC CAAAGAATAT TTTTATATTA TAGCAAAGCT TTAAATTGAA 60 

TGTTAGAGTC TTGTTCAAAA CAATCATCAA AACCACGTGG ATGATGGTAT TCTACTAAGT 120 

GTTGATCTTG AGGATAAGTG TACTTACCGC CAACTTCCCA GATAAATGGA TGGAAATCGT 180 

ATTGCAAGCG ATCTTTTCGC ATTTTCCAAA GTTCTAGAAT CTCATTAGTA GAAGCCATGA 2 40 

AGTTAGACCA GATATCATAG TGAACTGGGA TAATGACTTT GGTACGCAGA TTTTCTGCCA 3 00 

TACGAAGAAG GTCGATAGAT GTCAkTTTGT CTTGGATACC TACCGGATTT TCACCATAGT 3 60 

TATTCAAAGC AACATCAATT TTAAAGTCTT TACCATGTTT TGCAAAATAG TTTGAGAAGT 420 

GAGAATCTGC ACCATGATAG ATGGTTCCAC CTGGTGTTTC AAAGATATAG TTAACAGCCT 4 80 

TTTGAGCCAT TTCTTCATCT GTAACAGCCA AGCCAGCAgT TCACCGCCTG TCTCATCAGC 540 

ACCGTTCACT GGGAGAGTTA CCAAGCAAGT ACGGTCAAAT GATTCTACTG CATGAA 596 
(2) INFORMATION FOR SEQ ID NO: 348: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CAGAGTCAAC AGCCTGAGTT GAAGGCAACT TTAGACACAG CAGTTACGAC AGCTGAATGA 60 

GCTCCTCCAT CAGTTTTTTC TTTAATGAGT CCAGCTACAT CTTCAACTTC GAGGCCGTTA 120 

ATCACAATGT CAGCGCCTAC TTCTTTTGCA AGGGCAAGTT TGTCATTGTT GATATCGACT 180 

GCGATAACAT GAGCATTGAA TACTTTTTTA GCGTATTGAA CAGCGAGGTT ACCAAGTCCA 240 

CCAGCACCGT AAAGAACAAC CCATTGGCCT GGTTCAACTT TTGCTTCTTT GATAGCTTTA 300 

TAGGTTGTTA CTCCAGCACA TGTGATAGAA GAAGCTTGGG CTGGATCAAG TCCGTCAGGA 3 60 

ACTTTGACAG CATAGTCAGC AGTTACGATA CATTGTTCAG CCATACCACC GTCTACTGAG 420 

TAGCCAGCAT TTTTCACTGT ACGGCAAAGG GTTTCGCGAC CAGTTGTACA GTATTCGCAA 480 

GTGCCACATC CTTCAAAGAA CCAAGCAACG CTGACGCGGT CACCGACTTT AAGGCTTTTC 540 

ACATCTGGAG CAATCTCTTT AACGATACCG ATACCTTCGT GCCCAAGAAC ACGTCCTGGG 600 

ACTTGACCAA AGTCACCATG AGCAACGTGG AGGTCGGTGT GGCAAACGCC CACAGTATTC 660 

ACTTCTACAA GTG 673 

(2) INFORMATION FOR SEQ ID NO: 349: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 198 base pairs 
<B) TYPE: nucleic acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 349: 
GTACCCTACA AATGCTTTAC AGTATGGGTT GAGGGTGGTC AATGGAACTA TGGAGTAGGT 
TGGACAGGAA CTTTTGGATA TTCTGATTAC TTACATTCTA CTCGATATCA TACAGCAACT 
GTTAGACATG GGGGTAGAAC CTCTAAGGAT TATGCAAAAC CTGAGGCATG GGCTAGAGCT 
TCCCTCACCA AGATTCCG 

(2) INFORMATION FOR SEQ ID NO: 350: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 base pairs 

(B) TYPE: nucleic acid 



120 
180 
198 
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(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 

GCTTCTTCTA TAGACAAAAA TATCATGGGT AAAATAATCA AGGCTATAGC TAGAAGGAGG 60 

GACCAATCCA CTACTAATCC TAAGAACAAA ACACTCAAGA GAGCAGAAGA GAGAGGTTCA 120 

CTGGCACTGA TAACGGCAAC CACCAAAGGA GAAACCAAGG ACACAGCCTT CATGGAAATG 180 

AAAAAAGCAA AAGCCGTTCC AAAGAAAGCG ATAATGAGGC AAATCAAGAT ACTCCAAATA 240 

TCAAGAGTAA AGGAAAGCTG ATAAACCGGC GAGAGGACAT TGCTAAACAA ACCTGCCAAA 3 00 

ATCATCCCCC ACCCAACCGT AGGAACAAAA CCATAACGCT TAGCAAAAGG TTGGGGCAAG 3 60 

ATAACATTAA ACATAACACC CATGGCACTC AGCAAACCTG TTATAAGAGC TAGCGGCGTC 420 

ATGGATAACT GAGAGAGGTC TCCCTTTGTC GCCATCAAGC AAACACCCAG CATGGCAACC 4 80 

AAAACATAGA AAACAGCGCT TTTTGACCCT CGTTTTTGAT AAACCAAGCG ATTGTAAAAG 540 

AGGATAAAGA CAGGGCTAAT AAACTGTAAA ATAGTTGCTG TCGTAGCATT TGAGTATTCT 6 00 

ACACAGAGAT AGAAAAAATA CTGAACTGAA AAAATCCCCA AAATAGCATA GGCTAAAAAG 650 

GGCAGGTAAT TTTTCTTGTC TCGCCAAATA TCTAGCACTT GCGATTTTAA TTGTATTGCA 720 

GACCAAATGA GTACAAGACT CCCTGCCAGT GTCAAACGCA TAGAGGTAAT CCAGCCCGAA 7 80 

GACACCTGAT AATGAGTAAA GAAGTACTCT CCTAAAATTC CACAGATTCC CCATATTAAG 840 

CCGGATAGGA GCGAATAAAT TTTTCCGTTA ACAATCTTTT TCTGATACTG A 891 
(2) INFORMATION FOR SEQ ID NO: 351: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 325 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 

GAAAGCGTTC AATAGAACAT TGCTTTTTTA TTTTTAGAGT AAGCTAAGCG CTTCAGCATC 60 

TGCGATGATG GTTACATCAG GGTGATTTTG GAGGCTACTT GCAGGTAGGT TCTCAGTCAC 12 0 

TGGGCCAGAT ACTGTTCCGG CAATGGCTTC TGCTTTCGAC TCACCGTAAG CAAAAAGAAT 180 

AATAGACTTG GCATCCAAAA TGTTTTTAAT CCCCATTGAA ATAGCTTGGG TTGGGACGTC 240 

TTCAATCTTG GCAAAGAAGC GTGCATTGGC TTCGATAGTA GACTGGTCAA GTTCTACTAG 3 00 
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ATGCGTTTGA CTGTCAAATG GAGTG 325 
(2) INFORMATION FOR SEQ ID NO: 352: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 

CAAGAGCAGT TTGATGATTT TTGATAAGCA TGCGAATTTA AAATACAAAT ATGGCAATCG 60 

CAAGTTTTGG TGTAGAGGCT ATTATGTAGA TACGGTAGGC CGTAATCAGA AAGTGATAGC 120 

TGAATATATT CAGAATCAAT TACAAGAAGA CAGAGTAGCA GACCTAGCTC ACGTTATTCG 180 

AGTCAGTAGA TCCGTTTACT GGCGAAATAA ATAAGAGGAA GTAACGTnAA GTGCTTTAGC 2 40 

ACCTGCTCGG GAAAGTGGTG CGCGAGGAAG CTATTTCAGG ATGCTTTGGC CCTGGCCGGT 300 

AGAAGCGTTA TAGCCGCAGA CTACGACACT TCACACTGGT GGTT 3 44 
(2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

CCCTATCCCT GCTATTGGGG CTGCTCTCAT TGCTGCTTTG GCACAAATCA GTCTTCCAAT 60 

TGGACCTGTT CCCTTCACTC TGCAAAACTT TGCAATCGGC TTGATTCTAC TGTCTTTAGA 12 0 

CCGAGAGAGG CTGTACTTTC TGCTGGACTC TATCTTCTTC TAGGTGCTAT CGGTCTTCCT 180 

GTCTTTGCAG GAGGTGGAGC TGGTTTTCAG GCTTTAGTTG GCCCTACTGC AGGCTATCTT 2 40 

TGGTTTTATC TCGTTTACTC TGGACTTACT TCCTCTCTAA CCAACAGCAA GAGTGGTGTT 300 

GTTAAGATTT TTCTTGCAAA CCTCTTGGGT GATGCCCTTG TCTTTGTCGG CGGGATTCTC 3 60 

AGCTTGCATT TCCTAGCTGG AATGGCATTT GAAAAAGCTC TTGCTGTGGG GGTTCTTCCC 42 0 

TTTATCATTC CAGACCTTGG CAAACTTCTA GCTATTAGTT TTATTAGCCG TCCCCTACTT 480 

CAACGCCTTA AAAATCAGGC TTACTTTACT AACTAAAAAA GGATATCGAG TTATCATGAC 54 0 

TCAATATCCT TTTCTTTTAT TTTGAAAACT TATACTCAAT GAAAATCAAA GAGCAAACTA 600 

GGAAGCTAGC CGCAGGCTnG CAAAACACTG TTTTGAGGTT GTGGATGAAA CTGACGAGTA 6 60 
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AnATCTCATA CATACGGCAA GGCAAAGCTG AC 
(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1005 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 
GTGATGGACT ACTGGTTCAA AACGCATCCA GAAGATTTTT TCGATAATGT CGGACCTCTT 
GTAGCCAGTA ACTTTTTTCA TACTTACACC GAAGATTTCC ACTTGATGAA GGAAATTGGA 
GTTAATTCTT TCCGCACTTC CATCCAATGG AGTCGACTCA TCAAGAATTT AGAGACAGGT 
GAGCCTGATC CAAAAGGTAT TGCTTTCTAC AATGCCATCA TTGAAGAAGC TAAAAAGAAC 
CAGATGGATC TTGTGATGAA TTTACATCAT TTTGATTTAC CAGTGGAACT TCTTCAAAAA 
TACGGTGGTT GGGAAAGCAA ACATGTAGTG GAGTTATTCG TGAAGTTTGC CAAGACTGCT 
TTCACATGCT TTGGAGATAA GGTTCATTAC TGGACAACTT TCAATGAGCC AATGGTCATT 
CCAGAAGCAG GGTACTTATA TGCTTTCCAT TATCCAAATC TAAAAGGAAA GGGAAAAGAG 
GCCGTACAAG TCATCTATAA TCTAAACCTT GCTAGTGCAA AAGTGATTCA ACTATATCGC 
TCATTAGAAC TTGATGGAAA GATTGGGATT ATTTTAAACT TGACACCTGC TTATCCAAGA 
AGTAATTCTC CAGAAGACTT AGAAGCAAGT CGATTTACAG ATGACTTCTT TAACAAAGTC 
TTCTTGAATC CAGCTGTTAA AGGAACTTTC CCAGAAAGAT TGGTAAAACA GCTAGAGAGA 
GATGGCGTGT TATGGAGTCA TACCGAAAAA GAGCTTCAAC TGATGAAATC AAATACGGTT 
GATTTTCTTG GAGTAAACTA CTACCATCCA AAACGTGTTC AAGCACAAGC AAATCCTGAG 
GAATATCAGA CGCCCTGGAT GCCAGACCAA TACTTCAAAG AGTATGAATG GCTGGAGCGT 
CGCATGAATC CATATCGTGG TTGGGAAATT TTTCCGAAAG CCATTTATGA TATTGCTATG 
ATTGTGAAGG AAGAATATGG TAATATCCCA TGGTTTATCA GTGAA 
(2) INFORMATION FOR SEQ ID NO: 355: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 

CCGACAAGCA ATATTAAAAA GAGTAAACTA TTAACTAGTT AATTAACCGG TTTATTACTT 6 0 

TATAGTGAAT CAAATATACT TAAGAAAAGA GGAAAGAATG AAAATTAATA AAAAATATCT 12 0 

AGCAGGTTCA GTGGCAGTCC TTGCCCTAAG TGTTTGTTCC TATGAGCTTG GACGTTACCA 180 

AGCTGGTCAG GATAAGAAAG AGTCTAATCG AGTTGCTTAT ATAGATGGTG ATCAGGCTGG 240 

TCAAAAGGCA GAAAACTTGA CACCAGATGA AGTCAGTAAG AGGGAGGGGA TCAACGCCGA 300 

ACAAATTGTT ATCAAGATTA CGGATCAAGG TTATGTGACC TCTCATGGAG ACCATTATCA 3 60 

TTACTATAAT GGCAAGGTTC CTTATGATGC CATCATCAGT GAAGAGCTCC TCATGAAAGA 42 0 

TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA ATCAAGGGTG GTTATGTCAT 4 80 

TAAGGTAAAC GGTAAATACT ATGTTTACCT CAAGGATGCA GCTCATGCGG ATAATATTCG 54 0 

GACAAAAGAA GAGATTAAAC GTCAGAAGCA GGAACGCAGT CATAATCATA ACTCAAGAGC 600 

AGATAATGCT GTTGCTGCAG CCAGAGCCCA AGGACGTTAT ACAACGGATG ATGGGTATAT 660 

CTTCAATGCA TCTGATATCA TTGAGGACAC GGGTGATGCT TATATCGTTC CTCACGGCGA 72 0 

CCATTACCAT TACATTCCTA AGAATGAGTT ATCAGCTAGC GAGTTAGCTG CTGCAGAAGC 780 

CTATTGGAAT GGGAAGCAGG GATCTCGTCC TTCTTCAAGT TCTAGTTATA ATGCAAATCC 84 0 

AGCTCAACCA AGATTGTCAG AGAACCACAA TCTGACTGTC ACTCCAACTT ATCATCAAAA 900 

TCAAGGGGGA AACATTTCAA GCCTTTTACG TGAATTGTAT GCTAACCCTT ATCAGAACGC 9 60 

CATGTGGGAT CTG 973 
(2) INFORMATION FOR SEQ ID NO: 356: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 

GGTCGCATCT GCAATATCTG TCGCCTCCAC ATAAGCGACA CCAGCCTTGT CTGCTGCCCG 60 

TTTGACACGT TCTGCAGATT GACCCAGGAT GACCATCTTC TTGAGTCCAG TAATGTCTGG 120 

CACCAATTCG TCAAACTCAT TGCCACGGTC CAAACCACCT GCAATCAAGA CGACCTTGCT 18 0 

GTTGTCAAAT CCTGACAAGC TTTTTGAGTA GCCAAGATAT TAGTTGATTT ACTGTCGTTA 240 

TAGAATTTAA CACsCTTGAT GTCATCCACA AACTGGAGAC GGTGTTTGAC ACCACCGAAG 30 0 

GCTGAAAGAG TTTCCTTGAT GGTTTGATTG TCCACATCAC GAAGCTTGGC TACAGCAATA 360 
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GTCGCAAGGG CATTTTCCAC ATTGTGGCTA CCTGGAACAC CGATTTCATT CGCTGCCATG 42 0 

ACTACTTCAC CACGGAAGTA GAGTTGACCA TCTTCCAGAT AAGCTCCATC AACCTTTTCA 4 80 

AGTGTTGAAA ATGGTACAAC AGTGGCTTCT GTCTTGGAAG TCAAGTCTTT TGCCAAGTCT 540 

TGATTAAAGT TCAAGACAAG GAAATCAGCT GCTGTCATCT TGTTCTGGAT ATTCCACTTG 600 

GCTGCTACAT ATTCCGAAAA TGACCCATGG TAGTCGATAT GAGTTGGCAT GAGGTTGGTA 660 

ATAACCGCAA TCTCTGGATG GAATTCTTGA ACACCCATGA GTTGGAAAGA AGAAAGTTCC 72 0 

ATAACAAGCG TGTCCTTATC TGATGCTATT TGAGCAACCT GACTAGCTGG ATAGCCGATA 780 

TTCCCTGATA AAAGACCATG TTGGCCAGCA GCAGTCAAAA CTTCCCGGGn TCCTCTAGAG 840 

TCG 843 
(2) INFORMATION FOR SEQ ID NO: 357: 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 357: 

TTTTTTTTAT ATTTTTTTTA TTTATTATTT TTTGGCAAAA AAGACCAATT TGCTTTGGAG 60 

CATTGCTTCT GCATTAAATT GTCTATTTTT GCTCGTGCTG TTACGCTCTT TGTATCATGT 120 

ATTAACTAGC AAGTGCAACT TGCAAACTAC TAGTAAGAGG AGAAAAACAA AATGGTTATG 180 

ACTGACCCAA TCGCAGACTT CCTAACTCGT ATTCGTAATG CTAACCAAGC TAAACACGAA 240 

GTACTTGAAG TACCTGCATC AAACATCAAA AAAGGGATTG CTGAAATCCT TAAACGCGAA 300 

GGTTTTGTAA AAAACGTTGA AATCATTGAA GATGACAAAC AAGGCGTCAT CCGTGTATTT 3 60 

CTTAAATACG GACCAAATGG TGAGAAAGTT ATCACTAACT TGAAACGTGT TTCTAAACCA 420 

GGACTTCGTG TCTACAAAAA ACGTGAAGAC CTTCCAAAAG TTCTTAACGG ACTTGGAATT 480 

GCCATCCTTT CAACTTCTGA AGGTTTGCTT ACTGATAAAG AAGCACGCCA AAAGAATGTT 540 

GGTGGTGAGG TTATCGCTTA CGTTTGGTAA AATCAAGATA CAAAGCTCGT AAAGAACAAA 600 

GCAAAATTAG GAAGTTGGAG AAGTTTGTTT ACAAACAGGC CAACTTATCT ATTTTGCACA 660 

GTTCTTAGAG CGTGTTCAGT TCAGCTCTTG AGCTAAGTAA GTATCTGAAC CCCGTGAAAA 720 

CTGGCCGTGC TGGCATGTTC GGGTAACAGG AGAnAATAAA CATGTCACGT ATTGGTAATA 7 80 

AGTTCAGCTA AGGCCTTCGT AAAAGTT 807 
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(2) INFORMATION FOR SEQ ID NO: 358: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 653 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 

CCCAGTATTT TTGTCCAAGC ACGACCAGAA AAGGATGATA CAGATCTGGA ATTGGCTCTC 60 

TTAACCATCT tTGAACAAAA TCCTCAGGCT CAGGTCACTA TTTTCGGTGC CTTGGGTGGC 120 

CGTATTGACC ATATGTTGGC CAATGTCTTT CTGCCTAGCA ATCCTAAGTT GGCACCCTAT 180 

ATGCATCAAA TAGAAATTGA GGATGGGCAA AACTTGATTA CTTATTGTCC AGAAGGAATC 240 

AGTCAGCTAG AACCTCGTTC AGACTACGAC TATCTAGCCT TTATGCCAGT TCGGGATAGC 3 00 

CAAGTATGAG TTGACAGAGG AAAATTTTTT CTTTAAAAAA GTGTACGCTT CTAACGAATA 3 60 

TATAGATAGG GAAGTGTCGG TAACTTGCCC AGATGGTTAT GTGGTCGTAC TGCATAGCAA 420 

GGACAGGAGG TAGGATGGAA AGTTTACTTA TTCTATTATT AATTGCCAAT CTAGCTGGTC 4 80 

TCTTTCTGAT TTGGCAAAGG CAGGATAGGC AGGAGAAACA CTTAAGTAAG AGCTTGGAGG 540 

ATCAGGCAGA TCATTTGTCA GACCAGCTGG ATTACCGCTT TGACCAAGCC AGACAAGCCA 600 

GCCAGTTAGA CCAAAAAGAT TTGGAAGTGG TTGTCAGCGA CCGTTTGCAA GAA 653 
(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 

CACCATGTGA TGTGACGCTG GCCACAGCTG TCAGAAATCT GGCGAGCCAT CGTGTGCAAT 60 

GACTCTTCCC GATGTAATCT TGTTCATAGT CCTTTGATGA ATATGTTCAA GCTGTAGAAG 12 0 

GTGCGCTTCC TGAACACTTA TCAACTGTTA CAGGCGAGTT GACCAGTCAG GAAACAGATG 180 

GCTGGTACAC ACTTGCCAAC ACTTCTTCAT CCCGCATTTA CCTAAAACAA GCCTTCCAAG 240 

AAAATAGCAA CCTCCTAGAG CAAGTGGTAG AACCCTTGAC TATTATCACT GGTGGACACA 3 00 

ACCACAAGGA CCAGTTGACC TATGCTTGGA AAACACTTTT GCAGAATGCG CCACATGATA 3 60 

GTATCTGTGG CTGTAGCGTG GACGAAGTTC ACCGCGAGAT GGAAACGCGT TTTGCCAAGG 42 0 
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TCAACCAAGT AGGAAACTTT GTTAAAAGTA ACTTGCTCAA CGAGTGGAAG GGTAAAATTG 480 

CTACGGATAA GGCTCAAAGT GACTATCTCT TTACTGTCAT TAACACAGGC TTGCATGATA 54 0 

AGGTCGATAC TGTCAGCACA GTGATTGATG TGGCGACTTG TGATTTCAAG GAATTGCACC 600 

CAACAGAAGG CTACAAAAAG ATGGCTGCTC TTATCTTGCC G 641 
(2) INFORMATION FOR SEQ ID NO: 3 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1958 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 

CCTCAAGGCC AATTTGAAGG CTCTAAAACA ATGGAAAAGT GCTACACAGA TGTGACAGAA 60 

TTTGCCATTC CAGCAAGTAC TCAAAAGCTT TACTTATCAC CAGTTTTAGA TGGCTTTAAT 12 0 

AGCGAAATTA TTGCTTTTAA TCTTTCGACT TCACCCAACT TAGAACAAGT ACAAACAATG 180 

TTAGAACAGG CATTCAAAGA GAAGCACTAC GAGAATACGA TTCTCCATAG TGACCAAGGC 24 0 

TGGCAATATC AACACGATTC TTATCATCGG TTCCTAGAGA GTAAGGGAAT TCAAGCATCT 30 0 

ATGTCACGCA AGGGCAACAG CCAAGACAAC GGTATGATGG AATCTTTCTT TGGCATTTTA 3 60 

AAATCCGAAA TGTTTTATGG C T ATGAG AAA ACATTTAAAT CACTTAACCA ATTGGAACAA 42 0 

GCCATTATAG ACTATATTGA TTACTACAAC AACAAACGAA TTAAGGTAAA ACTAAAAGGA 480 

CTTAGTCCTG TGCAGTACAG AACTAAATCC TTTGGATAAA TTAATTGTCT AACTTTTTGG 54 0 

GGTCAGTACA AAACTCTTGC TACTATGCGT TTTATTATTG AAAGACTTAT TGGACTTTCT 600 

CTCAAATCGA GTTTTTACTC AATTTTCTTA CTTGATTGGG ATTGAAATTC CAATTAATTT 660 

CTCTGAGTAG AGTGTCTTGA TATTGGCTTC ATCAACAGAG GCCTTATCAA TTTTACGTTT 72 0 

CAAGAAAAAT TCTTGAATGG TTTCGATTTC AGGCTCACGA ATAGCACGGT GTTTGTTTGA 780 

GATGAGGATT TCATAGTGAA GCGGAGCTTG GGTAAAAATA ACATCTGTAT TCCCTGCAGA 84 0 

ATAAACCTCA ACAAGGGTTG CATCGGTACT TTCTAGCTGA CTTTTTACAA GTTGCGAGTG 900 

TGAGTTTGTC GTATTGATAA GCTTCATAAT ATTTCCTCCG ATTTTCTAAT TCTATTATAG 9 60 

CACTTTTTGA ATAAAGTCGC TTGATTTATA CTCAATGAAA ATCAAAGAGC AAACTAGGAA 102 0 

GCTAGCCGCA GGCTATACTT GAGTACGGTA AGGCGACGCT GACGTGGTTT GAATTTTATT 1080 

TTCGAAGAGT ATTAGCCAAT CTTATGCTGT TTTTTCCAAG ATTCAATGGC CCATTTATGG 1140 
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CTACCACGTT TAAGGTTTTT GATAGCCTCG TCAATAGGGA ACCAGGCAAT ATGATTAAAG 1200 

TTTTCTAGTG GCTTTTGTAC TTCTTTGAAA GGAGTTGCTT CATAGAGGTA GGCAGGATTG 1260 

TAGTAGTAGG TATCACGATG ACGAGAATAG AAATATTCGT CAGCTTGTCC GTAATAGGTA 1320 

CCAATTTCTG CTGTGAAACC AAGCTCTTCA ATCAACTCAT GCTTTAGGGC TTCCTGATGA 1380 

TTTTCACCTG CTTCAATTTC TCCACATGGT AGGAACCAAG CACCATTTGG TTCTTGAACA 1440 

AGAACAATTT GTTTTTGTTC AGGATTAGGG ATAACTGCAT ATACGCCATA GCGAGCAATA 1500 

TAGTCTGTAT TCACTTTTTT TCTCCGAAAG TTGGGTTTGC CATTGCATTT T C C TC AT TAT 1560 

CTAGTATCGT TATTATTATA GTGAAATGAA CCAAAAATAG TACACAATGT GGTATAATCT 1620 

TCTTATGGCA TATTCAATAG ATTTTCGTAA AAAAGTTCTC TCTTATTGTG AGCGAACAGG 1680 

TAGTATAACA GAAGCATCAC ACGTTTTCCA AATCTCACGT AATACCATTT ATGGCTGGTT 1740 

AAAGCTAAAA GAGAAAACAG GAGAGCTAAA CCACCAAGTA AAAGGAATAA AACCAAGAAA 1800 

GGTTGATAGA GATAGACTTA AAAACTATCT TACTGACAAT CCAGACGCTT ATTTGACTGA 18 60 

AATAGCTTCT GAATTTGGCT GTCATCCAAC TACCATCCAC TATGCGCTCA AAGCTATGGG 19 20 

tACACTCGAA AAAAAAAAGA ACTACACCTA CTATGAAC 1958 
(2) INFORMATION FOR SEQ ID NO: 3 61: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361: 

TATGAAATTA AGTTATGATG ATAAAGTTCA GATCTATGAA CTTAGAAAAC AAGGATATAG 60 

CTTAGAGAAG CTTTCAAATA AATTTGGGAT AmACAATTCT AATCTTAGGT ATATGATTAA 12 0 

ATTGATTGAT CGTTACGGAA TAGAGTTCGT CAAAAAAGGA AAAAATCGTT ACTATTCTCC 180 

TGATTTAAAA CAAGAAATGA TTAATAAAGT CTGACATGAA GGCTGGACTA AAGATAGAGT 2 40 

TTCTCTTGAA TACGGTCTCC CAAGTCGTAC GATACTTCTT AACTGGCTAG CACAATACAG 3 00 

GAAAAACGGG TATACTATTG TTGAGAAACC AAGAGGGAGA GTACCTGAGA GCGGAGAATG 3 60 

CCATCCTAAA AAAGTTAAGA GAACTCCGAT TGAAGGAGGA AAAAGAGAAA GAAGAAAGAC 420 

AGAAATTGTT TAAGAATTAA TGACTGAGTT TTCGTTAGAT CTTCTTTTAA AAGTCATTAA 4 80 

ACTAGCTCGT TCGACCTACT ACTATCACTT GAAACAGCTA GATAAACCAG ATAAGGACCA 54 0 

AGAGCTTAAA GCTGAAATTC AATCCATTTT TATCGAACAC AAAGGAAATT ATGCTTATCG 6 00 
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TCGGATTTAT TTAGAACTAA GAAATCGTGG TTATCTGGTA AATCATAAAA GAGTTCAAGG 660 

CTTGATGAAA GTACTCAATt TACAAGCTAA AACGCGACAG AAACGAAAAT ATTCTTCTCA 72 0 

TAAAGGAGAC GTTGGCAAGA AGGCAGAGAA TCTCATTCAA GGCCAATTTG AAGGCTCTAA 78 0 

AACAATGGAA CAGTGCTACA CAGATGTGAC AGAATTTGCC ATTCCAGTAA GTACTTAAAA 840 

GCTTTACTTA T 851 
(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1168 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

GGGTAGAATC GATATCTCCA ATGAGTTGGT tTAGCTGGTG AAACTGTAAA AAGATTTCGw 60 

CCAATTCAAG GTTGAGGCAT CGCAAACTAT GGACTGTTTC CTCGTCAGTT CTGGAAAGAA 120 

AACGGGATAA GGTTGGCTGT GAAGCAAGCT GCCCTCCTTC CAACAATTTT GGAAAGTAGG 180 

CATCAGCTGA CAATTCTTTA CAAGCATAGT CCGTTCCATA ACCTGTTAAC AGTTGAAAGA 240 

GGAACTGGAC AAGGATATCT GAATCCGAAT AACGACAGTA GCGGCGTTGG TCATTCGTTA 3 00 

CTAAATACTT AGAAATCCGC TCTTTTAGTT TCAACTGGGA AAAAAGTTCC TGAAAAAAGA 3 60 

TAAGACCACC ATACTGGGTT AAATGACCTC CATCGAAAGA TAGTTGGTAA AAAGACTTGT 42 0 

TTTGGAAGTG ATGATTTGGT AAACTGTTCA TGTGAGTTTC CTTTCTTTTT GTGTTTTTTT 480 

CTACACTTAT ACCATAAAGG GGAAACTCTT TTTTGTCTAG TAAAAAACAC CCATTGGGTG 540 

AAAAAAGAAA CCATCCAGGA TCTAAGCTAA GGCAAGGATT CTGGATGGTT TTTAGATTTG 600 

GGGTGAATAA TTGGGGTTTT AGCTGCTTGC GGCCAATCAG GTTCAGATAC AAAAACTTAC 660 

TCATCAACCT TTAGTGGAAA TCCAACTACA TTTAACTATC TATTAGACTA TTACGCTGAT 720 

AATATAGTCA ATTGAAACAA GAACAAGACA AAAGAGCCTC ATAAAAGGTA TTGCAACTTG 780 

GTAATACCTT TTTGAGGTGC TTTTTGATAT GAGCCCATGT TTTCTCAATA GGATTGTACT 840 

CAGGTGAGTA GGGAGGAAGA GGTAAAAGTT TATACCCAAA CTCTTCACAC AAGAGTTCTA 9 00 

ACTTACCCAT TCTATGGAAT CTTGCATTAT CCATAATAAT AACCGATGGT GTGTTTAATG 9 60 

TTGGTAAGAG AAATTTCTGA AACCAAGCTT CAAAAAAGTC GCTCGTCATC GTCTCTTCGT 102 0 

AAGTTATTGG AGCGATTAAC TCACCATTTG TTAGACCTGC AACCAAAGAA ATCCTCTGAT 10 80 
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ATCTTCTTCC AGATACTTTG CCTCTTCTTA ACTGACCTTT TAATGAGCGA CCATATTCTC 
GATAAAAATA AGTATCGAAT CCTGTTTC 
(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

GTCAGCTTCA GCAAGCCCAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC 60 

GTCAACCAGT GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC 12 0 

GTCGGCCTCA GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCGGC 180 

TTCAGCAAGC ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC 24 0 

GAGTGCGTCT GAGTCAGCAT CAACGAGTAC GTCAGCCTCA GCAAGCACAT CAGCTTCTGA 300 

ATCTGCATCA ACCAGTGCGT CAGCCTCAGC ATCGACAAGC GCCTCAGCTT CAGCAAGTAC 360 

CAGTGCTTCA GCCTCAGCGT CGACAAGTGC GTCGGCCTCA ACCAGTGCAT CTGAATCGGC 420 

ATCAACCAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC 480 

ATCGGCTTCA GCATCAACCA GTGCCTCGGC TTCAGCGTCA ACCAGTGCGT CAGCTTCAGC 540 

AAGTACCAGT GCTTCAGTCT CAGCATCAAC AAGTCCTTCA GCCTCAGCAT CGACAAGTGC 600 

CTCGGCTTCA GCAAGCACAT CAGCATCTGA ATCAGCGTCG ACAAGCGCCT CAGCTTCAGC 660 

AAGTACCAGT GCGTCAGCCT CAGCGTCGAC AAGTGCGTCA GCCTCAGCAA GTACTAGTGC 720 

ATCAGCTTCA GCATCAACGA GTGCATCGGC TTCGGCGTCA ACCAGTGCAT CAGAGTCAGC 780 

AAGTACCAGT GCGTCAGCTT CCGCATCAAC AAGTGCCTCG GCTTCAGCAA GCACCAGTGC 840 

GTCGGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCCTCA ACCAGTGCGT CAGCCTCAGC 900 

AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCC GCTTCAGCAA GTACTAGCGC 960 

CTCAGCCTCA GCGTCAACAA GTGCATCGGC TTCAGCGTCA ACGAGTGCGT CTGAATCGGC 1020 

ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC 1080 

ATCGGCTTCA GCATCAACGA GTGCGTCCGC TTCAGCAAGT ACTAGCGCCT CAGCCTCAGC 1140 

GTCAACAAGT GCATCGGCTT CAGCGTCAAC GAGTGCGTCT GAGTCAGCAT CAACGAGTGC 1200 

GTCAGCCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CAgCCTCAGC 12 60 

ATCGACAAGC GCCTCAGCTT CAGCAAGTAC CAGTGCGTCA GcTCAGCGTC GACAAGTGCs 1320 
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TCrGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCAAGTA CCAGTGCkTC AGCCTCAGCG 13 80 

TCGACAAGTG CGTCGGCCTC AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 1440 

TCAGCAAGTA CTAGCGCCTC AGCCTCAGCA TCAACGAGTG CGTCCGCTTC AGCAAGTACT 1500 

AGTGCATCAG CTTCAGCAAG TACTAGCGCC TCAGCCTCAG CGTCGACAAG CGCCTCAGCT 15 60 

TCAGCAAGTA CCAGTGCGTC AGCCTCAGCG TCGACAAGTG CGTCGGCTTC AGCAAGTACC 162 0 

TCAGCGTCTG AATCAGCATC AACAAGTGCG TCGGCTTCAG CATCAACGAG TGCATCAGCT 16 8 0 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 17 40 

AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCCG CATCAACAAG TGCCTCGGCT 1800 

TCAGCAAGCA CCAGTGCTTC GGCTTCAGCG TCAACGAGTG CGTCTGAGTC AGCATCAACG 18 60 

AGTGCGTCAC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCACTTC 192 0 

CGCATCAACA AGCGCCTCGG CCTCAGCAAG TACAAGTGCT TCAGCCTCAG CATCAACCAG 19 8 0 

TGCATCAGCT TCAGCCTCAA CAAGTGCTTC AGCCTCAGCG TCAACCAGTG CCTCGGCTTC 2040 

AGCAAGTACC AGTGCGTCAG CTTCAGCAAG CACAAGTGCG TCAGCTTCAG CATCAACCAG 2100 

TGCTTCGGCT TCGGCATCAA CAAGTGCCTC AGCATCAGCA TCAACGAGTG CGTCAsCTCA 2160 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACCAGTGCAT CAGCCTCAGC AAGTATCTCA 2220 

GCGTCTGAAT CGGCATCAAC GAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA 228 0 

GCGTCAACCA GTGCATCAGT CTCAGCAAGC ACCAGTGCGT CGGCTTCAGC ATCAACCAGT 2 34 0 

GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT CAACGAGTGC GTCAGcCTCA 2400 

GCAAGTACTA GTGCATCAGC ATCAGCATCA ACGAGTGCAT CGGCTTCAGC AAGTACCAGC 24 6 0 

GCCTCAGCTT CAGCAAGCAC CAGTGCGTCA GCCTCAGCAA GTACCAGCGC CTCAGCCTCA 252 0 

GCAAGCACCA GTGCCTCAGC TTCAGCAAGT ACCAGTGCGT CAGCCTCAGC GTCGACAAGT 2580 

GCGTCGGCTT CAGCAAGTAC CTCAGCGTCT GAATCAGCAT CAACGAGTGC ATCAGCTTCA 2 640 

GCATCAACAA GTGCTTCAGC TTCAGCAAGT ACCAGTGCGT CGGCTTCAGC ATCAACGAGT 2700 

GCTTCAGTCT CAGCGTCAAC CAGTGCCTCT GAATCAGCAT CAACAAGTGC CTCGGCTTCA 27 60 

GCAAGCACCA GTGCGTCGGC TTCAGCAAGT ACTAGTGCAT CGGCTTCAGC ATCGACAAGT 2 82 0 

GCGTCTGAAT CGGCATCAAC GAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCAGCCTCA 2 880 

GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CCGCTTCAGC GTCAACCAGT 294 0 

GCGTCGGCTT CAGCGTCGAC AAGTGCTTCG GCTTCAGCAT CAACGAGTGC GTCGGCCTCA 3000 

GCAAGCGCAA GTACCTCAGC GTCAGCTTCC GCCTCAACCA GTGCGTCCGC TTCAGCAAGC 3060 
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ACAAGTGCGT CAGCCTCAGC AAGTATCTCA GCGTCTGAAT CGGCATCAAC GAGTGCGTCG 312 0 

GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT CAACCAGTGC GTCGGCTTCA 3180 

GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT 3240 

GCGTCTGAGT CAGCATCAAC GAGTACGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCG 33 00 

GCATCAACCA GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT 3360 

GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG GCCTCAACCA GTGCATCTGA ATCGGCATCA 3 42 0 

ACCAGTGCGT CAGCCTCAGC AAGTACTAGT GCATCAGCTT CAGCATCAAC GAGTGCATCG 348 0 

GCTTCAGCAT CAACCAGTGC CTCGGCTTCA GCGTCAACCA GTGCGTCAGC TTCAGCAAGT 3 540 

ACCAGTGCTT CAGTCTCAGC ATCAACAAGT GCTTCAGCCT CAGCATCGAC AAGTGCCTCG 3 600 

GCTTCAGCAA GCACATCAGC ATCTGAATCA GCGTCGACAA GCGCCTCAGC TTCAGCAAGT 3 660 

ACCAGTGCGT CAGCCTCAGC GTCGACAAGT GCGTCAGCCT CAGCAAGTAC TAGTGCATCA 3 720 

GCTTCAGCAT CAACGAGTGC ATCGGCTTCG GCGTCAACCA GTGCATCAGA GTCAGCAAGT 3780 

ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT CAGCAAGCAC CAGTGCGTCG 3 840 

GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA GTGCGTCAGC CTCAGCAAGT 3 900 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT CAGCAAGTAC TAGCGCCTCA 3960 

GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACGA GTGCGTCTGA ATCGGCATCA 4020 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT CAGCGTCAAC AAGTGCATCG 4 080 

GCTTCAGCAT CAACGAGTGC GTCCGCTTCA GCAAGTACTA GCGCCTCAGC CTCAGCGTCA 4140 

ACAAGTGCAT CGGGTTCAGC GTCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTGCGTCA 42 00 

CCTCAkCAAG CACATCAGCT TCTGAATCTG CATCAACCAG TGCGTCACTT CCGCATCAAC 4 2 60 

AAGCGCCTCG GCCTCAGCAA GTACAAGTGC TTCAGCCTCA GCATCAACCA GTGCATCAGC 43 2 0 

TTCAGCCTCA ACAAGTGCTT CAGCCTCAGC GTCAGACCAG TGCCTCGGCT TCAGCAAGTA 4 3 80 

CCAGTGCGTC ACTTCAGCAA GCACAAGTGC GTCAGCTTCA GCATCAACCA GTGCTTCGliC 4440 

TTCGGCATCA ACAAGTGCCT CAGCATCAGC ATCAACGAGT GCG 4 4 83 
(2) INFORMATION FOR SEQ ID NO: 3 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2550 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 
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GTACCTCAGC GTCCTTCCGC 
CCTCAGCAAG TATCTCAGCG 
CAAGTACCTC AGCGTCACTT 
GTCASCTCAG CAAGTATCTC 
TCAACGAGTA CGTCAGCCTC 
TCAGCCTCAG CATCGACAAG 
TCGACAAGTG CGTCGGCCTC 
TCAGCAAGTA CTAGTGCATC 
AGTGCCTCGG CTTCAGCGTC 
TCAGCATCAA CAAGTGCTTC 
TCAGCATCTG AATCAGCGTC 
TCAACCAGTG CGTCAGCCTC 
TCGGCTTCGG CGTCAACCAG 
TCAACAAGTG CCTCGGCTTC 
TCGGCTTCAG CAAGTACCAG 
AGCACCTCAG CTTCTGAATC 
TCTGAATCGG CCTCAACCAG 
AGCACAAGCG CCTCGGGTTC 
TCAGCCTCAG CATCAACAAG 
TCAACGAGTG CGTCTGAGTC 
TCAGCCTCAG CAAGTATCTC 
AGTACTAGCG CCTCAGCATC 
TCTGAGTCAG CATCAACGAG 
TCAACCAGTG CGTCAGCCTC 
TCAGCCTCAG CAAGTACCAG 
AGTGCATCTG AATCGGCATC 
CAGCATCAAC GAGTGCATCG 
GTGCGTCACt TCCGCATCAA 
AGCGTCAACC AGTGCTTCGG 
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CTCAACCAGT GCGTCCGCTT CAGCAAGCAC AAGTGCGTCA 60 

TCTGAATCGG CATCAACGAG TGCGTCGGCC TCAGCAAGCG 12 0 

CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC 180 

AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA 24 0 

AGCAAGCACA TCAGCTTCTG AATCGGCATC AACCAGTGCG 300 

CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG 3 60 

AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC 42 0 

AGCTTCAGCA TCAACGAGTG CATCGGCTTC AGCATCAACC 48 0 

AACCAGTGCG TCAGCTTCAG CAAGTACCAG TGCTTCAGTC 540 

AGCCTCAGCA TCGACAAGTG CCTCGGCTTC AGCAAGCACA 600 

GACAAGTGCG TCGGCCTCAA CCAGTGCATC TGAATCGGCA 660 

AGCAAGTACT AGTGCATCAC CTTCAGCATC AACGAGTGCA 72 0 

TGCATCAGAG TCAGCAAGTA CCAGTGCGTC AGCTTCCGCA 780 

AGCAAGCACA TCAGCATCTG AATCAGCGTC AACCAGTGCT 840 

TGCTTCAGCT TCAGCATCAA CCAGCGCCTC GGCCTCAGCA 900 

GGCCTCAACC AGCGCCTCGG CCTCAGCAAG CACCTCAGCT 960 

CGCCTCAGCC TCAGCATCAA CGAGTGCTTC GGCTTCAGCA 1020 

AGCATCAACG AGTACGTCAG CTTCAGCGTC AACCAGTGCT 1080 

TGCGTCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 1140 

AGCATCAACG AGTACGTCAG CCTCAGCAAG CACAAGTGCT 1200 

AGCGTCTGAA TCGGCATCAA CGAGTGCGTC CGCTTCAGCA 12 60 

AGCGTCAACA AGTGCTTCGG CTTCAGCCTC AACGAGTGCG 1320 

TACGTCAGCC TCAGCAAGCA CATCAGCTTC TGAATCTGCA 13 80 

AGCATCGACA AGCGCCTCAG CTTCAGCAAG TACCAGTGCG 1440 

TGCTTCAGCC TCAGCGTCGA CAAGTGCGTC GGCCTCAACC 1500 

AACCAGTGCG TCAGCTCAGC AAGTACTAGT GCATCAGCTT 1560 

GCTTCGGCGT CAACCAGTGC ATCAGAGTCA GCAAGTACCA 1620 

CAAGTGCCTC GGCTTCAGCA AGCACATCAG CATCTGAATC 1680 

CTTCAGCAAG TACCAGTGCT TCAGCTTCAG CATCAACCAG 1740 
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AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGCGCC TCAGCCTCAG CATCAACGAG 1860 

TGCTTCGGCT TCAGCAAGCA CAAGCGCCTC GGGTTCAGCA TCAACGAGTA CGTCAGCTTC 192 0 

AGCGTCAACC AGTGCTTCAG CCTCAGCATC AACAAGTGCG TCAGCCTCAG CAAGTATCTC 1980 

AGCGTCTGAA TCGGCATCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC 2 040 

AGCAAGCACC TCAGCTTCTG AATCGGCCTC AACCAGTGCG TCAGCCTCAG CATCGACAAG 2100 

CGCCTCAGCT TCAGCAAGTA CCAGTGCTTC AGCCTCAGCG TCGACAAGTG CGTCGGCCTC 2160 

AACCAGTGCA TCTGAATCGG CATCAACCAG TGCGTCAGCC TCAGCAAGTA CTAGTGCATC 222 D 

GGCTTCAGCA TCAACCAGTG CCTCGGCTTC AGCGTCAACC AGTGCGTCAG CTTCAGCAAG 2280 

TACCAGTGCT TCAGTCTCAG CATCAACAAG TGCTTCAGCC TCAGCATCGA CAAGTGCCTC 234 0 

GGCTTCAGCA AGCACATCAG CATCTGAATC AGCGTCGACA AGCGCCTCAG CTTCAGCAAG 2400 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCAGCT ACAGCAAGTA CTAGTGCATC 24 6 0 

AGCTTCAGCA TCAACGAGTG CATCGGCTTC GGCGTCAACC AGTGCATCAG AGTCAGCAAG 252 0 

TACCAGTGCG TCAGTTCACG CATCAACAAG 2550 
(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 65: 

ACCCAGCAAG TACTAGTGCA TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA 60 

CCAGTGCCTC AGCCTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAC 12 0 

CTCAGCAAGT ACTAGTGCAT CAGCATCAGC ATCAACGAGT GCATCGGCTT CAGCAAGTAC 180 

CAGCGCCTCA GCTTCAGCAA GCACCAGTGC GTCAsCTCAG CAAGTACCAG CGCCTCAGCC 24 0 

TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 300 

AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 360 

TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTATCTCAG CGTCTGAATC GGCATCAACG 420 

AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG CGTCAACAAG TGCTTCGGCT 480 

TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA CGTCAGCCTC AGCAAGCACA 540 

TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG CATCGACAAG CGCCTCAGCT 600 



WO 98/18931 



PCTYUS97/19588 



1381 

TCAGCAAGTA CCAGTGCGTC AgCCTCAGCA AGTACCAGTG CTTCAGCCTC AGCGTCGACA 6 60 

AGTGCGTCGG CCTCAACCAG TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA 720 

AGTACTAGCG CCTCAGCCTC AGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGTGCA 780 

TCAGCTTCAG CAAGTACTAG CGCCTCAGCC TCAGCGTCGA CAAGCGCCTC AGCTTCAGCA 840 

AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG TACCTCAGCG 900 

TCTGAATCAG CATCAACAAG TGCGTCGGCT TCAGCATCAA CGAGTGCATC AGCTTCAGCA 9 60 

TCAACAAGTG CTTCAGCTTC AGCAAGTACC AGTGCGTCGG CTTCAGCATC AACGAGTGCT 1020 

TCAGTCTCAG CGTCAACCAG TGCCTCTGAA TCCGCATCAA CAAGTGCCTC GGCTTCAGCA 1080 

AGCACCAGTG CTTCGGCTTC AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTGCG 1140 

TCAGCCTCAG CAAGCACATC AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCTTCCGCA 12 00 

TCAACAAGCG CCTCGGCCTC AGCAAGTACA AGTGCTTCAG CCTCAGCATC AACCAGTGCA 12 60 

TCAGCTTCAG CCTCAACAAG TGCTTCAGCC TCAGCGTCAA CCAGTGCCTC GGCTTCAGCA 13 20 

AGTACCAGTG CGTCAGCTTC AGCAAGCACA AGTGCGTCAG CTTCAGCATC AACCAGTGCT 13 80 

TCGGCTTCGG CATCAACAAG TGCCTCAGCA TCAGCATCAA CGAGTGCGTC AGCCGG 14 3 6 
(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 

GCAGTTGCCA CACCGTGCTG ACCAGCACCC GTTCCTGCGA TAATTTTCTT TTTACCCATG 60 

CGTwTGGCAA GCCAAACTTG TCCTAAGGCA TTGTTAATCT TGTGGGCTCC TGTATGGTTA 12 0 

AGGTCTTCCC GTTTGAGATA AATCTTGCTC CGCCAATATG CTGGGTCAAG TTTTTTGCGT 180 

AATAAAGAGG AGTTTCACGT CCTACGTACT GGCGCAAAAG CTGGTTTAAT TCCTCTTGGA 240 

AACTTGGGTC TGCCTGACTT TCACGGTAGG CCTTCTCCAA CTCCAAAACT GCTGTCATCA 3 00 

ATGTTTCTGG GACAAAACGT CCGCCGAATT TTCCGTAAAA TCCATCTTTA TTTGGTTCCT 3 60 

GATATGCCAT GCTTTACCCT CTCTATAAAT CTTCTAATCT TTTCATGATC TTTTTGTCCA 42 0 

TCTGTCTCCA CTCCGCTCGA TACATCTACT GCATAGGGAG TAAAGTGTTG AATTGCTTTT 480 

ACTACATTAT CTTCATTAAG GCCACCTGCG ATAAAGAAGG GCTGTGCTAG TCCAGTCGTA 540 
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TCCAGTTGAC CCCAATCAAA GGGCTGGCCA CTTCCTGCCA CAGGGGCATC AAAGAGTAGA 600 

TAATCTGCCT GAGAATTGGG GACATGCCCA TTTCCATCTA CCTGCACAGC CTGAATACTG 6 60 

GCACAAGGCA AATTCTCAAA TAAATCATCT GCCACCTGAC CGTGAACTTG AACCAAGTCC 72 0 

AAGCCGGGGA TCCTC 735 
(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1702 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 67: 

TACTAGCGCC TCAGCCTCAG CGTCAACAAG TGCATCGGCT TCAGCATCAA CGAGTGCGTC 6 0 

CGCTTCAGCA AGTACTAGCG CCTCAGCCTC AGCGTCAACA AGTGCATCGG CTTCAGCGTC 12 0 

AACGAGTGCG TCTGAGTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 180 

TGAATCTGCA TCAACCAGTG CGTCAGCCTC AGCATCGACA AGCGCCTCAG CTTCAGCAAG 24 0 

TACCAGTGCG TCAGCCTCAG CGTCGACAAG TGCGTCGGCT TCAGCAAGTA CCAGTGCGTC 30 0 

AGCCTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CCTCAACCAG 3 60 

TGCATCTGAA TCGGCATCAA CCAGTGCGTC AGCCTCAGCA AGTACTAGTG CATCAGCTTC 42 0 

AGCATCAACG AGTGCATCGG CTTCAGCATC AACCAGTGCA TCAGAGTCAG CAAGTACCAG 480 

TGCGTCAgCT TCCGCATCAA CAAGTGCCTC GGCTTCAGCA AGTACTAGCG CCTCAGCCTC 540 

AGCGTCAACA AGTGCTTCAG CTTCCGCGTC AACCAGCGCC TCGGCCTCAG CAAGTATCTC 600 

AGCGTCTGAA TCGGCATCAA CAAGTGCCTC GGCTTCAGCA TCAACGAGTG CATCAGTCTC 660 

AGCAAGCACC AGTGCGTCGG CCTCAGCAAG CACCAGCGCG TCTGAATCCG CATCAACCAG 72 0 

TGCCTCAGCT TCAGCAAGTA CCTCAGCATC TGAATCAGCA TCAACAAGTG CATCGGCT .'C 78 0 

AGCAAGCACA AGTGCTTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG 84 0 

TGCGTCCGCT TCAGCAAGTA CTAGCGCCTC AGCATCAGCG TCAACAAGTG CTTCGGCTTC 900 

AGCGTCAACG AGTGCGTCTG AGTCAGCATC AACGAGTACG TCAGCCTCAG CAAGCACATC 960 

AGCTTCTGAA TCTGCATCAA CCAGTGCGTC AGCCTCAGCA TCGACAAGCG CCTCAGCTTC 102 0 

AGCAAGTACC AGTGCGTCAG CCTCAGCAAG TACCAGTGCT TCAGCCTCAG CGTCGACAAG 108 0 

TGCGTCGGCC TCAACCAGTG CATCTGAATC GGCATCAACC AGTGCGTCAG CCTCAGCAAG 114 0 

TACTAGCGCC TCAGCCTCAG CATCAACGAG TGCGTCCGCT TCAGCAAGTA CTAGTGCATC 1200 
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AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC AGCGCCTCAG CTTCAGCAAG 12 60 

CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC TCAGCAAGCA CCAGTGCCTC 1320 

AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA AGTGCGTCGG CTTCAGCAAG 13 80 

TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT TCAGCATCAA CAAGTGCTTC 144 0 

AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG AGTGCTTCAG TCTCAGCGTC 1500 

AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT TCAGCAAGCA CCAGTGCGTC 1560 

GGCTTCAGCA AGTACTAGTG CATCGGCTTC AGCATCGACA AGTGCGTCTG AATCGGCATC 162 0 

AACGAGTGCT TCGGCTTCAG CATCAACGAG TGCGTCAGCC TCAGCAAGCA CATCAGCTTC 1680 

TGAATCTGCA TCAACCAGTG CG 17 02 
(2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 941 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 
ACCAGTGCAT CAGCTTCAGC CTCAACAAGT GCTTCAGCCT CAGCGTCAAC CAGTGCCTCG 
GCTTCAGCAA GTACCAGTGC GTCACTTCAG CAAGCACAAG TGCGTCACTT CAGCATCAAC 
CAGTGCTTCG GCTTCGGCAT CAACAAGTGC CTCAGCATCA GCATCAACGA GTGCGTCACC 
TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACCAGTG CATCAGCCTC AGCAAGTATC 
TCAGCGTCTG AATCGGCATC AACGAGTGCA TCAGCATCAG CATCAACGAG TGCATCGGCT 
TCAGCGTCAA CCAGTGCATC AGTCTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACG 
AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 
TCAGCAAGTA CTAGTGCATC GGCTTCAGCA AGCACCAGTG CGTCGGCTTC AGCATCAACC 
AGTGCCTCAG CCTCAGCAAG TATCTCAGCG TCTGAATCGG CATCAACGAG TGCGTCAGCC 
TCAGCAAGTA CTAGTGCATC AGCATCAGCA TCAACGAGTG CATCGGCTTC AGCAAGTACC 
AGCGCCTCAG CTTCAGCAAG CACCAGTGCG TCAGCCTCAG CAAGTACCAG CGCCTCAGCC 
TCAGCAAGCA CCAGTGCCTC AGCTTCAGCA AGTACCAGTG CGTCAGCCTC AGCGTCGACA 
AGTGCGTCGG CTTCAGCAAG TACCTCAGCG TCTGAATCAG CATCAACGAG TGCATCAGCT 
TCAGCATCAA CAAGTGCTTC AGCTTCAGCA AGTACCAGTG CGTCGGCTTC AGCATCAACG 
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AGTGCTTCAG TCTCAGCGTC AACCAGTGCC TCTGAATCAG CATCAACAAG TGCCTCGGCT 900 
TCAGCAAGCA CCAGTGCGTC GGCTTCAGCA AGTACTAGTG C 941 
(2) INFORMATION FOR SEQ ID NO: 369: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 869 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 69: 

CAGCAAGTAC TAGTGCATCA GCTTCAGCAT CAACGAGTGC ATCGGCTTCT GCGTCAACCA 60 

GTGCATCAGA GTCAGCAAGT ACCAGTGCGT CAGCTTCCGC ATCAACAAGT GCCTCGGCTT 12 0 

CAGCAAGCAC CAGTGCGTCG GCTTCAGCAA GTACTAGCGC CTCAGCCTCA GCCTCAACCA 180 

GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCCGCTT 240 

CAGCAAGTAC TAGCGCCTCA GCCTCAGCGT CAACAAGTGC ATCGGCTTCA GCGTCAACGA 300 

GTGCGTCTGA ATCGGCATCA ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCCT 360 

CAGCGTCAAC AAGTGCATCG GCTTCAGCAT CAACGAGTGC GTCCGCTTCA CCAAGTACTA 42 0 

GCGCCTCAGC CTCAGCGTCA ACAAGTGCAT CGGCTTCAGC GTCAACGAGT GCGTCTGAGT 480 

CAGCATCAAC GAGTGCGTCA GCCTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 540 

GTGCGTCAGC CTCAGCATCG ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCGTCAGCCT 600 

CAGCGTCGAC AAGTGCGTCG GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA 660 

GTGCGTCAGC CTCAGCGTCG ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT 72 0 

CAACCAGTGC GTCAGCCTCA GCAAGTACTA GTGCATCAGC TTCAGCATCA ACGAGTGCAT 780 

CGGCTTCAGC ATCAACCAGT GCATCAGAGT CAGCAAGTAC CAGTGCGTCA GnTTCCGCAT 840 

GCAACAAGTG CCTCGGCTTC AGCAAGTAC 869 
(2) INFORMATION FOR SEQ ID NO : 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 370: 
TCAACAAGTG CCTCAGCATC AGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 60 
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TCAGCATCAG CATCAACCAG TGCATCAGCC TCAGCAAGTA TCTCAGCGTC TGAATCGGCA 12 0 

TCAACGAGTG CATCAGCATC AGCATCAACG AGTGCATCGG CTTCAGCGTC AACCAGTGCA 180 

TCAGTCTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCCTC AGCCTCAGCA 24 0 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 300 

TCGGCTTCAG CAAGCACCAG TGCGTCGGCT TCAGCATCAA CCAGTGCCTC AGCCTCAGCA 360 

AGTATCTCAG CGTCTGAATC GGCATCAACG AGTGCGTCAG CCTCAGCAAG TACTAGTGCA 420 

TCAGCATCAG CATCAACGAG TGCATCGGCT TCAGCAAGTA CCAGCGCCTC AGCTTCAGCA 480 

AGCACCAGTG CGTCAGCCTC AGCAAGTACC AGCGCCTCAG CCTCAGCAAG CACCAGTGCC 540 

TCAGCTTCAG CAAGTACCAG TGCGTCAGCC TCAGCGTCGA CAAGTGCGTC GGCTTCAGCA 600 

AGTACCTCAG CGTCTGAATC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT 660 

TCAGCTTCAG CAAGTATCTC AGCGTCTGAA TCGGCATCAA CGAGTGCGTC GGCTTCAGCA 720 

AGTACTAGCG CCTCAGCATC AGCGTCAACG 750 
(2) INFORMATION FOR SEQ ID NO: 3 71: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 957 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

CCGGAAAACA GCTCTGGCGC TTGGTCTTGC CCAGCGTATT GCTAGTGGTG ACGTGCCTGC 60 

GGAAATGGCT AAGATGCGCG TGTTAGAACT TGATTTGATG AATGTCGTTG CAGGGACACG 120 

CTTCCGTGGT GACTTTGAAG AACGCATGAA TAATATCATC AAGGATATTG AAGAAGATGG 180 

CCAAGTCATC CTCTTTATCG ATGAACTCCA CACCATCATG GGTTCTGGTA GCGGGATTGA 2 40 

TTCGACTCTG GATGCGGCCA ATATCTTGAA ACCAGCCTTG GCGCGTGGAA CTTTGAGAAC 3 00 

GGTTGGTGCC ACTACTCAGG AAGAATATCA AAAACATATC GAAAAAGATG CGGCACTTTC 3 60 

TCGTCGTTTC GCTAAAGTGA CGATTGAAGA ACCAAGTGTG GCAGATAGTA TGACTATTTT 4 20 

ACAAGGTTTG AAGGCGACTT ATGAGAAACA TCACCGTGTA CAAATCACAG ATGAAGCGGT 480 

TGAAACAGCG GTTAAGATGG CTCATCGTTA TTTAACCAGT CGTCACTTGC CAGACTCTGC 540 

TATCGATCTC TTGGATGAGG CGGCAGCAAC AGTGCAAAAT AAGGCAAAGC ATGTAAAAGC 600 

AGACGATTCA GATTTGAGTC CAGCTGACAA GGCCCTGATG GATGGCAAGT GGAAACAGGC 660 
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AGCCCAGCTA ATCGCAAAAG AAGAGGAAGT ACCTGTCTAC AAAGACTTGG TGACAGAGTC 720 

TGATATTTTG ACCACCTTGA GTCGCTTGTC AGGAATCCCA GTTCAAAAAC TGACTCAAAC 780 

GGATGCTAAG AAGTATTTAA ATCTTGAAGC AGAACTCCAT AAACGGGTTA TCGGTCAAGA 840 

TCAAGCTGTT TCAAGCATTA GCCGTGCCAT TCGCCGCAAC CAGTCAGGGA TTCGCAGTCA 900 

TAAGCGTCCG ATTGGTTCCT TTATGTTCCT AGGGCCTACA GGTGTCGGGG TATCCGA 957 
(2) INFORMATION FOR SEQ ID NO: 372: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 807 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 372: 
CAAAGCGCCT CAGCTTCAGC ATCAACAAGT GCGTCGGCTT CAGCATCAAC CAGTGCCTCG 60 

GCTTCAGCGT CAACCAGTGC GTCACATTCA GCAAGTACCA GTGCTTCAGT CTCAGCATCA 120 

ACAAGTGCTT CAGCCTCAGC ATCGACAAGT GCCTCGGCTT CAGCAAGCAC ATCAGCATCT 180 

GAATCAGCGT CAACCAGTGC TTCGGCTTCA GCAAGTACCA GTGCTTCAGC TTCAGCATCA 240 

ACCAGCGCCT CGGCCTCAGC AAGCACCTCA GCTTCTGAAT CGGCCTCAAC CAGCGCCTCG 300 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GCGCCTCAGC CTCAGCATCA 360 

ACGAGTGCTT CGGCTTCAGC AAGCACAAGC GCCTCGGGTT CAGCATCAAC GAGTACGTCA 420 

GCTTCAGCGT CAACCAGTGC TTCAGCCTCA GCATCAACAA GTGCGTCAGC CTCAGCAAGT 480 

ATCTCAGCGT CTGAATCGGC ATCAACGAGT GCGTCTGAGT CAGCATCAAC GAGTACGTCA 540 

GCCTCAGCAA GCACCTCAGC TTCTGAATCG GCCTCAACCA GTGCGTCAGC CTCAGCATCG 600 

ACAAGCGCCT CAGCTTCAGC AAGTACCAGT GCTTCAGCCT CAGCGTCGAC AAGTGCGTCG 660 

GCCTCAACCA GTGCATCTGA ATCGGCATCA ACCAGTGCGT CAGCCTCAGC AAGTACTA3T 7 20 

GCATCGGCTT CAGCATCAAC CAGTGCCTCG GCTTCAGCGT CAACCAGTGC GTCAGCTTCA 7 80 

GCAAGTACCA TGTGCTTCAT GTCTCAG 8 07 
(2) INFORMATION FOR SEQ ID NO : 37 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1068 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

2 AGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTACCGCC TCAGCCTCAG 
CGTCAACAAG TGCATCGGCT TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTG 
CGTCACCTCA GCAAGCACAT CAGCTTCTGA ATCTGCATCA ACCAGTGCGT CACCTCAGCA 
TCGACAAGCG CCTCAGCTTC AGCAAGTACC AGTGCGTCAC CTCAGCGTCG ACAAGTGCGT 
CGGCTTCAGC AAGTACCAGT GCGTCAsCTC AGCAAGTACC AGTGCGTCAC CTCAGCGTCG 
ACAAGTGCGT CGGCCTCAAC CAGTGCATCT GAATCGGCAT CAACCAGTGC GTCACCTCAG 
CAAGTACTAG TGCATCAGCT TCAGCATCAA CGAGTGCATC GGCTTCAGCA TCAACCAGTG 
CATCAGAGTC AGCAAGTACC AGTGCGTCAG cTTCCGCATC AACAAGTGCC TCGGCTTCAG 
CAAGTACTAG CGCCTCAGCC TCAGCGTCAA CAAGTGCTTC AGCTTCCGCG TCAACCAGCG 
CCTCGGCCTC AGCAAGTATC TCAGCGTCTG AATCGGCATC AACAAGTGCC TCGGCTTCAG 
CATCAACGAG TGCATCAGTC TCAGCAAGCA CCAGTGCGTC GGCCTCAGCA AGCACCAGCG 
CGTCTGAATC CGCATCAACC AGTGCCTCAG CTTCAGCAAG TACCTCAGCA TCTGAATCAG 
CATCAACAAG TGCATCGGCT TCAGCAAGCA CAAGTGCTTC AGCCTCAGCA AGTATCTCAG 
CGTCTGAATC GGCATCAACG AGTGCGTCCG CTTCAGCAAG TACTAGCGCC TCAGCATCAG 
CGTCAACAAG TGCTTCGGCT TCAGCGTCAA CGAGTGCGTC TGAGTCAGCA TCAACGAGTA 
CGTCAGCCTC AGCAAGCACA TCAGCTTCTG AATCTGCATC AACCAGTGCG TCAGCCTCAG 
CATCGACAAG CGCCTCAGCT TCAGCAAGTA CCAGTGCGTC AGCCTCAGCA AGTACCAGTG 
CTTCAGCCTC AGCGTCGACA AGTGCGTCGG GCTCAACCAG 1 
(2) INFORMATION FOR SEQ ID NO: 374: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 620 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 

CAGCATCAAC GAGTGCTTCA GTTTCAGCGT CAACCAGTGC CTCTGAATCA GCTTCAACAA 60 

GTGCCTCGGC TTCAGCAAGC CCCAGTGCGT CGGCTTCAGC AAGTACTAGT GCATCGGCTT 12 0 

CAGCATCGAC AAGTGCGTCT GAATCGGCAT CAACGAGTGC TTCGGCTTCA GCATCAACGA 18 0 

GTGCGTCAGC CTCAGCAAGC ACATCAGCTT CTGAATCTGC ATcAACCAGT GCGTCCGyTT 240 
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CAGCGTCAAC CAGTGCGTCG GCTTCAGCGT CGACAAGTGC TTCGGCTTCA GCATCAACGA 3 00 

GTGCGTCGGC CTCAGCAAGC GCAAGTACCT CAGCGTCAGC TTCCGCCTCA ACCAGTGCGT 360 

CGGCTTCAGC AAGCACAAGT GCGTCAGCCT CAGCAAGTAT CTCAGCGTCT GAATCGGCAT 42 0 

CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC ACATCAGCTT 480 

CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA GCTTCAGCAA 540 

GTACCAGTGC TTCAGCCTCA GCGTCGACAA GTGCGTCGGC CTCAACCAGT GCATCTGAAT 600 

CGGCATCAAC CAGTGCGTCA 62 0 
(2) INFORMATION FOR SEQ ID NO: 375: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 375: 

GTATTGGGGC GCCCCAACCT CTATGTGACT ACGGATTATT TCCTAGATTA CATGgGGATA 60 

AACCATTTAG AAGAATTACC AGTGATTGAT GAGCTTGAGA TTCAAGCCCA AGAAAGCCAA 12 0 

TTATTTGGTG AAAGGATAGA AGAAGATGAG AATCAATAAG TATATTGCCC ACGCAGGTGT 180 

GGCCAGTAGG AGAAAAGCAG AAGAGCTGAT TAAGCAAGGC TTGGTGACGG TTAACGGCCA 240 

AGTGGTGCGT GAACTAGCAA CCACTATCAA GTCAGGCGAC AAGGTCGAAG TTGAAGGTCA 300 

ACCTATCTAC AACGAAGAAA AGGTCTACTA TCTGCTTAAC AAACCACGCG GTGTGATTTC 360 

CAGTGTGACA GATGATAAGG GTCGCAAGAC GGTTGTCGAC CTCTTGCCCA ATGTCAAAGA 42 0 

GCGTATTTAC CCTGTGGGTC GTTTGGACTG GGATACATCA GGTGTCTTGA TTTTGACCAA 480 

TGATGGGGAC TTTACAGACG AGATGATTCA CCCTCGTAAT GAGATTGACA AGGTTTATGT 540 

CGCGCGTGTT AAAGGTGTGG CCAATAAGGA CAATCTCCGC CCCTTGACCC GTGGTCTTGA 600 

GATTGATGGT AAGAAAACCA AGCCATAATA TATAGGTTTT GTAGCCTCTA CACCATAAAT 6 60 

ATTTGCTAAT AAAAATACTG TATTATTACC CTCTTAAGGT GCGAAATTAT TCAAGTTCTT 72 0 
(2) INFORMATION FOR SEQ ID NO: 376: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(□) TOPOLOGY : linear 



WO 98/18931 



PCT/US97/19588 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 376: 

CGCCATTTCC CATCGTACCG CCGAAAATCC CAGCGCCTCA GCCATCAAAT ATCCTATCAA 60 

CGTTCTCAAA AAAAGTGACC GCTCTCTCAT CATGTTTCCA AGTGGTAGCC GCCACTCAAA 12 0 

CGATGTCAAG GGGGGCGCAC ACTskATTGC CAAAATGGCC AAGGTCCGTA TCATGCCGGT 180 

TACCTACACC GGTCCCATGA CTTTGAAGGG CTTGATTAGC CGTGAACGTG TCGATATGAA 240 

CTTTGGAAAT CCAATCGATA TCTCAGATAT CAAGAAAATG AATGATGAAG GCATTGAAAC 3 00 

AGTCGCCAAT CGTATTCAAA CAGAATTCCA ACGTCTGGAC GAAGAAACGA AACAATGGCA 3 60 

CAATGATAAA AAACCAAATC CACTCTGGTG GTTTATCCGC ATCCCTGCCC TCATCCTTGC 42 0 

TATTATCCTC GCTATCCTAA CCATCATCTT TAGCTTTATC GCAACCTTCA TCTGGAACCC 4 80 

AGATAAGAAA AGAGAAGAAC TTGCATAGAA GAAATGAACC TTGGCCAAAC AGCTAAGGTT 54 0 

TTCATTTATA TAGTAGATTG GwACTAGAAT AGTACACCTC TACTTCTAAA ACATTTTTAG 600 

AAATCGATTT GACTGTCCTG ATCGATTTGT CCTAATCTTA TTTCAATT 64 8 
(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 

GTGCATCGCT TTCAGCATCG ACAAGTGCGT CTGAATCGGC ATCAACGAGT GCTTCGGCTT 60 

CAGCATCAAC GAGTGCGTCA GCTTCAGCAA GCACATCAGC TTCTGAATCT GCATCAACCA 120 

GTGCGTCCGC TTCAGCGTCA ACCAGTGCGT CGGCTTCAGC GTCGACAAGT GCTTCGGCTT 180 

CAGCATCAAC GAGTGCGTCG GCCTCAGCAA GCGCAAGTAC CTCAGCGTCA GCTTCCGCCT 240 

CAACCAGTGC GTCCGCTTCA GCAAGCACAA GTGCGTCAGC CTCAGCAAGT ATCTCAGCGT 3 00 

CTGAATCGGC ATCAACGAGT GCGTCGGCCT CAGCAAGCGC AAGTACCTCA GCGTCAGCTT 3 60 

CCGCCTCAAC CAGTGCGTCG GCTTCAGCAA GCACAAGTGC GTCAGCCTCA GCAAGTATCT 42 0 

CAGCGTCTGA ATCGGCATCA ACGAGTGCGT CTGAGTCAGC ATCAACGAGT ACGTCAGCCT 4 80 

CAGCAAGCAC ATCAGCTTCT GAATCGGCAT CAACCAGTGC GTCAGCCTCA GCATCGACAA 54 0 

GCGCCTCAGC TTCAGCAAGT ACCAGTGCTT CAGCCTCAGC GTCGACAAGT GCGTCGGCCT 600 

CAACCAGTGC ATCTGAATCG GCATCAACCA GTGCGTCAGC CTCAGCAAGT ACTAGTGCAT 660 
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1 GCATCGGCTT 
INFORMATION FOR SEQ ID NO: 378: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 37 8: 
CGAGATTCTC TGGAGTTATG GATGTCGTTC CAATATGTGC ACGTTGGAAT GTTAGTGCTT 
ATATGGGGGG AACAGAATCC TCTCTTGATT GAAGACAAGC TAGTCATTAG GCTGGTTTGT 
CTTTTTGTCA ACTGTAGTGG GTTGATATAA TAGTATTAGT GAGTGGGATA AAAGTTTCAT 
TTAGTTTATT CAGTACAAAT TTAACGGGTC AAGATTTATA TACTAGTGGT GTTTTTGGGG 
CTGAGAGAAG TATCTTGATT TTATGTGTGG TTTTTATACT TACAGTTGTT CTGCTCCAAA 
GAGCTTGTAG AGAAGAATTA GCTCATAAAG GAGATTGATT ATTTTGATAT CAAAAAAATG 
CACAGGATAA CCTGATGCAT TTTTTTAGCG ACAATGCTTG CTACTTCCTT CTGTCGAATT 
TAGACAATTT TAAACCCCAA TTATTCACCC CAAATCTAAA AACCATCCAG AATCCTTGCC 
TTAGCTTAGA TCCTGGATGG TTTCTTTTTT CACCCAATGG GTGTTTTTTA CTAGACAAAA 
AAGAGTTTCC CCTTTATGGT ATAAGTGTAG AAAAAAACAC AAAAAGAAAG GAAACTCACA 
TGAACAGTTT ACCAAATCAT CACTTCCAAA ACAAGTCTTT TTACCAACTA TCTTTCGATG 
GAGGTCATTT AACCCAGTAT GGTGGTCTTA TCTTTTTTCA GGAACTTTTT TCCCAGTTGA 
AACTAAAAGA GCGGATTTCT AAGTATTTAG TAACGAATGA CCAACGCCGC TACTGTCGTT 
ATTCGGATTC AGATATCCTT GTCCAGTTCC TCTTTCAACT GTTAACAGGT TATGGAACGG 
ACTATGCTTG TAAAGAATTG TCAGCTGATG CCTACTTTCC AAAATTATTG GAAGGAGGGC 
AGCTTGCTTC ACAGCCAACC TTATCCCGTT TTCTTTCCAG AACTGACGAG GAAACAGTCC 
ATAGTTTGCG ATGCCTCAAC CTTGAATTGG TCGAATTCTT TTT 
(2) INFORMATION FOR SEQ ID NO: 379: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



SEQUENCE DESCRIPTION: SEQ ID NO: 379: 
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CCGATGATTC TGATTGGTTT GCTCTTTACT TTGCTGGGAA TTTTGAGGTA GATCTATGAT 60 

TGAAATACTA ATTGTTTTAG CTATTATCCT ATCTCTTGCT TTGATTGTAT TGGTAACTAT 120 

ACAACCCCGT CAAAATCAAC TATTTTCCAT GGATGCCACT AGTAATATTG GTAAACCAAG 180 

CTACTGGCAG AGCAACACCT TGGTCAAGGT GCTCACTTTA TTGGTGAGTT TGGCTTTATT 240 

TATTCTACTA TTAACCTTTA TGGTGATTAC TTATAAATAA AAGAAAACTT CAGATATTCA 300 

CCTTTTGTGG ATTGGTCTGA AGTTTTCTTT TTTATACTCA ATGAAAATCA AAGAGCAAAC 3 60 

TAGGAAGCTA GCCGCAckGC TCAAAACACC GTTTTGAGGT TGTAGATATA ACTGACGAGc 42 0 

GACTCAAAAC ACCGTTTTGA GGTTGTAGAT ATAACTGACG AGcGACTCAA AACACCGTTT 4 80 

TGAGGTTGTG GATAGAACTG ACGAGcGACT CAAAACACCG TTTTGAGGTT GTGGATAGAA 54 0 

CTGACGAAGT CGcTCAAAAC ACCGTTTTGA GGTTGTGGAT AGAACTGACG AAtgctCAAA 600 

ACACCGTTTT GAGGTTGTGG ATAGAACTGA CGAAGCgaaC ATATATACAG CAAGGCGACG 6 60 

CTGACGTGGT TTGAAGAGTA TTACTGTCTA TATTTTTGCT AAAAATCAAC TTTTACTTGG 720 

ATGAAGGTTT TTTTTTTT 73 8 
(2) INFORMATION FOR SEQ ID NO: 3 80: 

li) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(!") STKANDEDMESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

CCGTCTTATC AAAGAGGTTA ACAAAGGCAC CAAATTTCTC GATACGAACG ACTTTAGCAC 60 

GGTAAACTTC ATCCACTTTG GCTTCACGAA CCAAACCAGC AATAATTTCT TTGGCACGGT 12 0 

TAATAGCATC TTGGTCACTA GAGTAGATAG ACACATTTCC TTCTTCGTCT ATATCAATCT 180 

TAACACCTGT TTCAGCGATA ATCTTGTCGA TGGTTTCTCC ACCCTTACCG ATGACAATCT 240 

TAATCTTGTC CACATCAATC TTGATCGTAT CAATTTTCGG AGCAGTTGGA GCCAATTCTG 300 

GACGAACTTC TGGAATGGTT GCTTCAATGA CATCAAGGAT TTCAAAACGC GCTTTCTTGG 3 60 

CTTGAGCAAG AGCCTCCGTC AAGATTTCTG CAGTAATCCC TTGAATCTTG ATATCCATTT 42 0 

GAAGGGCTGT AAT C C CAT C A CGAGTACCTG CAACCTTGAA GTCCATATCT CCAAAGTGAT 48 0 

CTTCCAAACC TTGGATATCT GTCAATACTG TGTAGTTATT TCCATCTGAG ATAAGCCCCA 54 0 

TAGCAATACC AGCTACTGGC GCCTTGATTG GCACACCACC AGCCATAAGG GCAAGAGTTC 600 
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CCGCACAGAT AGAAGCTTGA GATGAAGAAC CGTTTGATTC CAAAACTTCT GCTACTAGAC 660 
GGATAGCGTA GGGGAATTCT TCCAAGCTTG GCAGG 695 
(2) INFORMATION FOR SEQ ID NO : 3 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 691 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 381: 

GACATCTTAT CTAAATACAT GCTAATATAT TTAGATACAA ACATTCCAAC TTGATAATTT 60 

TCACTCATCT TTCATCATTC CTTATACAAC TATGCAGTAT AAATAGAATA GTTTTCTCAT 120 

CAGAATGAGA CTATTTTAAT ATTAGATCCC CAATTATTCA CCCCAAATCT AAAAAC CAT C 180 

CAGAATCCTT GCCTTAGCTT AGATCCTGGA TGGTTTCTTT TTTCACCCAA TGGGTGTTTT 240 

TTACTAGACA AAAAAGAGTT TCCCCTTTAT GGTATAAGTG TAGAAAAAAA CACAAAAAGA 300 

AAGGAAACTC ACATGAACAG TTTACCAAAT CATCACTTCC AAAACAAGTC TTTTTACCAA 3 60 

CTATCTTTCG ATGGAGGTCA TTTAACCCAG TATGGTGGTC TTATCTTTTT TCAGGAACTT 420 

TTTTCCCAGT TGAAACTAAA AGAGCGGATT TCTAAGTATT TAGTAACGAA TGACCAACGC 480 

CGCTACTGTC GTTATTCGGA TTCAGATATC CTTGTCCAGT TCCTCTTTCA ACTGTTAACA 540 

GGTTATGGAA CGGACTATGC TTGTAAAGAA TTGTCAGCTG ATGCCTACTT TCCAAAATTG 600 

TTGGAAGGAG GGCAGCTTGc TTCACAGCCA ACCTTATCCC GwTTTCTTTC CAGAACTGAC 6 60 

GAGGAAACAG TCCATAGTTT GCGATGCCTC A 691 
(2) INFORMATION FOR SEQ ID NO: 382: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 

ATCTCTCTGC GTAATGGTCC TCAGATAACT CTGATGATGT GTGGCGATAT AGAACTGAGC 60 

CAAGTTATGC CTAAAGGGCC TTAGGAATAG GAGCTTTCAC AAGCTTATCC AGATGATTAT 12 0 

CTTTTACTCG TTATGGACAA TGCTATATGG CATAAATCAA GTACCTTAAA GATTCCGACT 180 

AATATTGGCT TTGCATTTAT TCCTCCATAC ACACCAGAGA TGAACCCCAT TGAACAAGTG 24 0 
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TGGAAAGAGA TTCGTAAACG TGGATTTAAG AATAAAGCCT TTCGAACTTT GGAAGATGTC 3 00 

ATACAAGGAC TGGAGAAGGA GGTGATAAAG TCCATCGTTA ATCGGAGACG GACTAGAATG 3 60 

CTTTTTGAAA ACAGATGAGT ATAAAAAGAA AGTCCTCATT TCAATAGAAA TCACGACTTT 420 

CTGATGAATT TATAGTAAAA TGAAATAAGA ACAGGATAGT CAAATCGATT TCTAACAATG 480 

TTTTAGAAGC AGAGGTGTAC TATTCTAGTT TAAATCCACT ATATTTGGGG AGTGATAGAA 540 

AAGCCCTTCA TCAGCCAATC TACTTGTTCA GGTGCGAGAG CTTTGACATC CTTTTCTGTA 600 

CTGGACCAAG TCAGTTTTCC GTTCTCAAAG CGTTTATATA ATATCCAAAA TCCTTGACCA 660 

TCCCAGTAAA GAACTTTAAA GCGGTCTTTA CGTCCACCAC AAAAGAGAAA GACTTGATCG 72 0 

GAGAAAGGAT CCAATTCAAA GTGGGTTTGG 750 
(2) INFORMATION FOR SEQ ID NO: 383: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

TCAAATTCTT CGTGGTCCGC ATATCTnTCT TCGTACACGG CAGTCACTTG GTCTTTCACT 60 

ACTCGAGTCG CAGCTTCACG GGCCAATTTC TCTTCTACTT GAACTGCCTT TTGGAGGTCA 120 

CTGTTGTAGG CTGCAATGAT TTCAGCTTGC AATTCAGCAT CCACGTGAAG CAATTCCACT 180 

TCTGCTTTTT CTTTACCGAC AGCAGCAACG ATTTCTTCTT GGAAGGCAAT CAATTCTTTG 240 

ACAGCTTCGT GCCCTTTAAG GAGCGCTTCC AACATGATTT CTTCTGACAA TTCTTTGGCA 300 

CCAGACTCTA CCATGTTGAT AGCGTGCTTG GTTCCAGCTA CTGTCAATTC AAGAAGAGAT 3 60 

TGCTCTGCTT GTTCTTGACT TGGGTTGATG ATGATTTGGC CATCTACATA TCCCACTTGT 420 

ACCCCAGCAA TTGGTCCGTC AAATGGAATA TCTGAAATAG ACAGTGCCAA AGATGAACCA 480 

AACATAGCAG CCATTGGTGC AGATGCATTT TCATCATAAG AAAGCACTGT ATTGATGACT 540 

TGGACTTCAT TACGGAAACC TTCCGCAAAC ATAGGACGAA TCGGACGGTC AATCAAACGC 600 

GCTGTCAAGG TCGCATCTGT TGAAGGACGT CCTTCACGTT TCATAAAGCC ACCAGGAAAC 560 

TTCCCAGCCG CATACATTTT TTCTTCGTAG TTGACTTGGA GTGGGAAGAA ATCCTCAGTT 720 

GCCATTTTCT GGGGATCC 738 
(2) INFORMATION FOR SEQ ID NO: 384: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 657 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 

CCCCCTATTT ACCGTGGACT AAAGTTGTAC AAGAAAAGTG CAAATAAGAA ATCTCCAGAT 60 

TAGGAACTAT ATATGAGTTC TCTAGTCTGG AGATTTTTCA ATAGACTTCG TTATTGGGCG 120 

GTTACTTTCG AAACTTTGAA AACTTCAAAA AACGGATTTT TATCGCTTTC AAATTCTTTT 180 

GGGGTCAAAC TCAGTAACTT ATTCGCCTTG TAGACTTCAT GACGCTCAGG GTATACTTTC 240 

AAGGTCCCAA ATAGCCAAGA ATCGTCAGCG ATATTATCTG AATCATCTCC TTCTTGTTCT 300 

CCTTTAGTTC GCCTGAGGAC AGCCTTGACA CGCGCCAGAA TTCTCTAGGG CTAAAAGGCT 3 60 

TGGTCAGGTA GTCATCAGCC CCTAATTCCA AGGCCAAAAC CTTATCAAAT TCATCACTTT 420 

TCGCAGAAAC CATCATAATT GGAGTTTTGA CGCCTTTGGC TCTCAGCCGC TTACAAACTT 4 80 

CCATGCCATC TAATTGTGGT AACATGATAT CAAGCAAGAT AAAATCAAAG GGTTCTGTTT 540 

CTGCCAAAGC TAAGGCCTTC CGTCCATTTG TCACCAATTG AGTAGAAAAG CCTTCCTTAC 600 

TTAAATGGTA GTCAAGCAAT TTCAGAATGT GTTCTTCATC ATCCACTAAT AAGACTT 657 
(2) INFORMATION FOR SEQ ID NO: 385: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 

CCGCATCAGC ATCAACGAGT GCATCGGCTT CACGTCAACC AGTGCATCAG TCTCAGCAAG 6 0 

CACCAGTGCG TCGGCTTCAG CATCAACGAG TGCCTCAGCC TCAGCAAGTA TCTCAGCGTC 12 0 

TGAATCGGCA TCAACGAGTG CGTCAGCTCA GCAAGTACTA GTGCATCGGC TTCAGCAAGC 180 

ACCAGTGCGT CGGCTTCAGC ATCAACCAGT GCCTCAGCCT CAGCAAGTAT CTCAGCGTCT 240 

GAATCGGCAT CAACGAGTGC GTCACCTCAG CAAGTACTAG TGCATCAGCA TCAGCATCAA 300 

CGAGTGCATC GGCTTCAGCA AGTACCAGCG CCTCAGCTTC AGCAAGCACC AGTGCGTCAC 360 

CTCAGCAAGT ACCAGCGCCT CAGCCTCAGC AAGCACCAGT GCCTCAGCTT CAGCAAGTAC 42 0 

CAGTGCGTCA CCTCAGCATC GACAAGTGCG TCGGCTTCAG CAAGTACCTC AGCGTCTGAA 480 
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TCAGCATCAA CGAGTGCGTC AGCTTCAGCA TCAACCAGTG CCTCAGCCTC AGCAAGTATC 540 
AGTGCGTCAG CTTCAGCATC AACGAGTGCG TCAGCTGCAG CAAGTA 586 
(2) INFORMATION FOR SEQ ID NO: 386: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 
CGTCGGCTTC AGCATCAACG AGTGCATCAG CTTCAGCATC AACAAGTGCT TCAGCTTCAG 
CAAGTACCAG TGCGTCGGCT TCAGCATCAA CGAGTGCTTC AGTCTCAGCG TCAACCAGTG 
CCTCTGAATC CGCATCAACA AGTGCCTCGG CTTCAGCAAG CACCAGTGCT TCGGCTTCAG 
CGTCAACGAG TGCGTCTGAG TCAGCATCAA CGAGTGCGTC ACCTCAGCAA GCACATCAGC 
TTCTGAATCT GCATCAACCA GTGCGTCAGC TTCCGCATCA ACAAGCGCCT CGGCCTCAGC 
AAGTACAAGT GCTTCAGCCT CAGCATCAAC CAGTGCATCA GCTTCAGCCT CAACAAGTGC 
TTCAGCCTCA GCGTCAACCA GTGCCTCGGC TTCAGCAAGT ACCAGTGCGT CAGTTcAGCA 
AGCACAAGTG CGTCAATTTA GCATCAACCA G 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 425 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 

TCTCAGCAAG CACCATTGCG TCGGCTTCAT CAAGCACCAG CGCGTTTGAA TCCGCATCAA 60 

CCAGTGCTTC AGCTTCAGCC AAGTTACCTC AGCATCTGAA TCAGCATCAA CAAGTGCATC 120 

GGCTTCAGCA AGCACAAGTG CTTCAGCtCA GCAAGTATCT CAGCGTCTGA ATCGGCATCA 180 

ACGAGTGCGT CCGCTTCAGC AAGTACTAGC GCCTCAGCAT CAGCGTCAAC AAGTGCTTCG 240 

GCTTCAGCGT CAACGAGTGC GTCTGAGTCA GCATCAACGA GTACGTCAGC CTCAGCAAGC 300 

ACATCAGCTT CTGAATCTGC ATCAACCAGT GCGTCAGCCT CAGCATCGAC AAGCGCCTCA 360 

GCTTCAGCAA GTACCAGTGC GTCAGCCTCA GCAAGTACCA GTGCTTCAGC CTCAGCGTCG 42 0 



60 
120 
180 
240 
300 
360 
420 
451 
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ACAAG 

(2) INFORMATION FOR SEQ ID NO: 




(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

AGAGGATCCC CGGATCCTCA GTCGCTGAGA TAACTCCTTT GGGCTTGTTC ATCATGTAGT 60 

AG AC AAAC T C TTCATACTCC AACACTTGCC CATTTTATGC GAATCTCATC TATTTTTTCT 120 

TTTTTTTGCA ATTTAGCTGA TTTTTCTTTT TTACCATTTA CAGTCACGCG CCCAGCCTTG 180 

AGCAAGTTTT TGACCTCAGT CCGACTTCCC ACCGCACAGG CAACTAAAAA TTTATCTAAT 240 

CTCATAGAAC TATTATATCA TATCAAAAGG AGGCTAGTAC AATGACCAAC CTCCTTTTCG 300 

TTTCATACTC TTCAAAAATC TCTTCAAACC GCGTCAACGT CGCCTTGCCG TATATATGTT 3 60 

ACTGACTTCG TCAGTTCTAT CTGCAACCTC AAAACAGTGT TTTGAGCTGA CTTCGTCAGT 420 

TCTATCTGCA ACCTCAAAGC AGTGCTTTGA GCATCCTGCG GCTAGTTTCC kAGTkTGCTC 4 80 

TTTGATTTwC ATTGAGTATC AGATTTAGGA AATTAACTTC CTCGlcCTCCA AAAAAkAGCT 540 

AAAACAATCA AGGCTCCTAA AATCGCTGGG AT 572 
(2) INFORMATION FOR SEQ ID NO: 389: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CAACAAGTGC CTCGGCTTCA GCATGCACAA GTGCTTCAGC TTCAGCATGT ACCTGAGCGT 60 

CTGAATCAGC ATCAACGTGT GCGTCCGCTT CAGCATGTAC TGCTGCCTCA GCATCAGCGT 120 

CAACAwGTGC TTCGGCTTCA GCGTCAACGA GTGCGTCTGA GTCAGCATCA ACGAGTACGT 180 

CAGCCTCAGC AAGCACATCA GCTTCTGAAT CTGCATCAAC CAGTGCGTCA GCCTCAGCAT 240 

CGACAAGCGC CTCAGCTTCA GCAAGTACCA GTGCGTCAGC CTCAGCAAGT ACCAGTGCTT 300 

CAGCCTCAGC GTCGACAAGT GCGTCGGCCT CAACCAGTGC ATCTGAATCG GCATCAACCA 3 60 

GTGCGTCAGC CTCAGCAAGT ACTAGCGCCT CAGCCTCAGC ATCAACGAGT GCGTCCGCTT 420 
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CAGCAAGTAC TAGTGCATCA GCATCAGCAT CAACGAGTGC ATCGGCTTCA GCAAGTACCA 
GCGCCTCAGC TTCAGCAAGC ACCGG 
(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 base pairs 

(B) TYPE: nucleic acid 
(C) 
(D) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 390: 

GCTAAGACTA CCTCATTAGG GGCATAGGCT GCTAAAATAA CTGCAGCTGT GGTTAATGAC 60 

AATACTGTAC TTTTTTTCAT TTTAATTCCT TACATATTTA TATAACTTCC AATAGATAAT 12 0 

AAACTTTAAC TTTGCTAGCC TTTGTTATAA AAAGTTTTAC TAAGTATTAT CTAGGAAATA 180 

GAGTAGTACA TTTATATATA ATTGTTATCT CTCTATAAAA ACAGTATATC ATTTAAAAAA 240 

ATTTAAGTCA AAAAAATTAA CATTAGTTAA TTTATTTTTT AGCACACATT AAAAAATAAG 3 00 

ATTAGTACTC AATGAAAATC AAAGAGCAAA CTAGGAAACT AGCCGCAGAT TGCTCAAAAC 3 60 

AGTGTTTTGA GGTTGTAGAT GGAATGACGT AGTCAGCTCA AAACACTGTT TTGAAGTTGT 420 

GGATAGAACT GACGAAGTCG GTACCGA 447 
(2) INFORMATION FOR SEQ ID NO: 391: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY ; linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 

AGCACTTGTC GTTGAATTCT ACAACAAAAT GTTGTAATAT TTTATTGAAT AAGATAGGCC 60 

TTGATATTAA GCACTTTGGG ACGTTCTCCC TTAGTGCTTT TTTGATTTCT CTTAGTATCC 12 0 

AGCTATAATC GTTGAGACAT AACTAGACCG ATATAGTCCA AAGTGATATA GTAAAATGAA 180 

CCAAAAATAG TACACAATGT GGTATAATCC TTTTATGGCA TATTCAATAG ATTTTCGTAA 240 

AAAAGTTCTC TCTTATTGTG AGCGAACAGG TAGTATAACA GAAGCATCAC ACGTTTTCCA 3 00 

AATCTCACGT AATACCATTT ATGGCTGGTT AAAGCTAAAA GAGAAAACAG GAGAGCTAAA 3 60 

CCACCAAGTA TAGTGTATTG AATCTATAAC AGTACACCTT GGCTGCTAAA ATATTTCTAT 420 
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AAATTAATTT GACTTTCCTG AT AG AG AT GT TCACATCTTA TTTCAAACTA CTATATAAGT 4 80 

TCTATAATCT CTTTATAAGA TTTGCCCATC AGACAAAATA GAACGATTTG AAGGCGTTTA 540 

TGATATTTAG CTGTACGAGA GTCTTTTAAA AG 572 
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DENMARK 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Danish Patent Office), or has been finally decided upon by the Danish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the art. The request to this effect shall be filed by the applicant with the 
Danish Patent Office not later than at the time when the application is made available to the 
public under Sections 22 and 33(3) of the Danish Patents Act. If such a request has been filed by 
the applicant, any request made by a third party for the furnishing of a sample shall indicate the 
expert to be used. That expert may be any person entered on a list of recognized experts drawn 
up by the Danish Patent Office or any person approved by the applicant in the individual case. 

SWEDEN 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Swedish Patent Office), or has been finally decided upon by the Swedish Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected to an expert in the aft. The request to this effect shall be filed by the applicant with the 
International Bureau before the expiration of 16 months from the priority date (preferably on the 
Form PUT/RO/134 reproduced in annex Z of Volume I of the PCT Applicant's Guide). If such a 
request has been filed by the applicant, any request has been filed by the applicant, any request 
made by a third party for the furnishing of a sample shall indicate the expert to be used. That 
expert may be any person entered on a list of recognized experts drawn up by the Swedish Patent 
Office or any person approved by the applicant in the individual case. 

UNITED KINGDOM 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for the International 
publication of the application. 

NETHERLANDS 

The applicant hereby requests that until the date of a grant of a Netherlands patent or until the 
date on which the application is refused or withdrawn or lapse, the microorganism shall be made 
available as provided in Rule 3 1F(1) of the Patent Rules only by the issue of a sample to an 
expert. The request to this effect must be furnished by the applicant with the Netherlands 
Industrial Property Office before the date on which the application is made available to the 
public under Section 22C or Section 25 of the Patents Act of the Kingdom of the Netherlands, 
whichever two dates occurs earlier. 
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SINGAPORE 

The applicant hereby requests that the furnishing of a sample of a microorganism shall only be 
made available to an expert. The request to this effect must be filed by the applicant with the 
International Bureau before the completion of the technical preparations for international 
publication of the application. 

NORWAY 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Norwegian Patent Office), or has been finally decided upon by the Norwegian Patent 
Office without having been laid open to public inspection, the furnishing of a sample shall only 
be effected to an expert in the art. The request to this effect shall be filed by the applicant with 
the Norwegian Patent Office not later than at the time when the application is made available to 
the public under Sections 22 and 33(3) of the Norwegians Patents Act. If such a request has been 
filed by the applicant, any request made by a third party for the furnishing of a sample shall 
indicate the expert to be used. That expert may be any person entered on a list of recognized 
experts drawn up by the Norwegian Patent Office or any person approved by the applicant in the 
individual case. 

AUSTRALIA 

The applicant hereby gives notice that the furnishing of a sample of a microorganism shall only 
be effected prior to the grant of a patent, or prior to the lapsing, refusal or withdrawal of the 
application, to a person who is a skilled addressee without an interest in the invention 
(Regulation 3.25(3) of the Australian Patents Regulations). 

FINLAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the National Board of Patents and Registration), or has been finally decided upon by the 
National Board of Patents and Registration without having been laid open to public inspection, 
the furnishing of a sample shall only be effected to an expert in the art. 

ICELAND 

The applicant hereby requests that, until the application has been laid open to public inspection 
(by the Icelandic Patent Office), or has been finally decided upon by the Icelandic Patent Office 
without having been laid open to public inspection, the furnishing of a sample shall only be 
effected in the art. 



WO 98/18931 



PCT/US97/19588 



1402 



What Is Claimed Is: 

1. Computer readable medium having recorded thereon the nucleotide 
sequence depicted in SEQ ID NOS: 1-391, a representative fragment thereof or a 
nucleotide sequence at least 95% identical to a nucleotide sequence depicted in SEQ 
ID NOS: 1-391. 

2. Computer readable medium having recorded thereon any one of the 
fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a degenerate variant 
thereof. 

3. The computer readable medium of claim 1, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

4. The computer readable medium of claim 3, wherein said medium is 
selected from the group consisting of a floppy disc, a hard disc, random access 
memory (RAM), read only memory (ROM), and CD-ROM. 

5. A computer-based system for identifying fragments of the Streptococcus 
pneumoniae genome of commercial importance comprising the following elements: 

a) a data storage means comprising the nucleotide sequence of SEQ ID 
NOS: 1-391, a representative fragment thereof, or a nucleotide sequence at least 
95% identical to a nucleotide sequence of SEQ ID NOS: 1-391; 

b) search means for comparing a target sequence to the nucleotide sequence 
of the data storage means of step (a) to identify homologous sequence(s), and 

c) retrieval means for obtaining said homologous sequence(s) of step (b). 

6. A method for identifying commercially important nucleic acid fragments 
of the Streptococcus pneumoniae genome comprising the step of comparing a 
database comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to a 
nucleotide sequence of SEQ ID NOS:l-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence is not randomly selected. 
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7. A method for identifying an expression modulating fragment of 
Streptococcus pneumoniae genome comprising the step of comparing a database 
comprising the nucleotide sequences depicted in SEQ ID NOS: 1-391, a 
representative fragment thereof, or a nucleotide sequence at least 95% identical to 
the nucleotide sequence of SEQ ID NOS: 1-391 with a target sequence to obtain a 
nucleic acid molecule comprised of a complementary nucleotide sequence to said 
target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 

8. An isolated protein-encoding nucleic acid fragment of the Streptococcus 
pneumoniae genome, wherein said fragment consists of the nucleotide sequence of 
any one of the fragments of SEQ ID NOS: 1-391 depicted in Tables 2 and 3, or a 
degenerate variant thereof. 

9. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome SEQ ID NOS: 1-391 depicted in Tables 2 and 3 or a 
degenerate variant thereof. 

10. An isolated fragment of the Streptococcus pneumoniae genome, 
wherein said fragment modulates the expression of an operably linked open reading 
frame, wherein said fragment consists of the nucleotide sequence from about 10 to 
200 bases in length which is 5' to any one of the open reading frames depicted in 
Tables 2 and 3 or a degenerate variant thereof. 

11. A vector comprising any one of the fragments of the Streptococcus 
pneumoniae genome of claim 8. 

12. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 8. 

13. An organism which has been altered to contain any one of the 
fragments of the Streptococcus pneumoniae genome of claim 10. 
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14. A method for regulating the expression of a nucleic acid molecule 
comprising the step of covalenlly attaching to said nucleic acid molecule a nucleic 
acid molecule consisting of the nucleotide sequence from about 10 to 100 bases 5' 

95 to any one of the fragments of the Streptococcus pneumoniae genome depicted in 

SEQ ID NOS:l-391 and Tables 2 and 3 or a degenerate variant thereof. 

15. An isolated nucleic acid molecule encoding a homolog of any of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 

100 Tables 2 and 3, wherein said nucleic acid molecule is produced by a process 

comprising steps of: 

a) screening a genomic DNA library using as a probe a target sequence 
defined by any of SEQ ID NOS: 1-391 and Tables 2 and 3, including fragments 
thereof; 

105 b) identifying members of said library which contain sequences that 

hybridize to said target sequence; and 

c) isolating the nucleic acid molecules from said members identified in step 

(b). 

no 

16. An isolated DNA molecule encoding a homolog of any one of the 
fragments of the Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and 
Tables 2 and 3, wherein said nucleic acid molecule is produced a process 
comprising steps of: 

1 15 a) isolating mRNA, DNA, or cDNA produced from an organism; 

b) amplifying nucleic acid molecules whose nucleotide sequence is 
homologous to amplification primers derived from said fragment of said 
Streptococcus pneumoniae genome to prime said amplification; 

c) isolating said amplified sequences produced in step (b). 

120 

17. An isolated polypeptide encoded by any of the fragments of the 
Streptococcus pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Table 2 
and 3 or by a degenerate variant of said fragments. 



125 18. An isolated polynucleotide molecule encoding any one of the 

polypeptides of claim 17. 
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19. An antibody which selectively binds to any one of the polypeptides of 
claim 17. 

130 

20. A method for producing a polypeptide in a host cell comprising the 
steps of: 

a) incubating a host containing a heterologous nucleic acid molecule whose 
nucleotide sequence consists of any one of the fragments of the Streptococcus 

135 pneumoniae genome of SEQ ID NOS: 1-391 and depicted in Tables 2 and 3, under 

conditions where said heterologous nucleic acid molecule is expressed to produce 
said protein, and 

b) isolating said protein. 



1/2 



Fisure 1 



PCT/US97/19588 




WO 98/18931 



2/2 



PCT/US97/19588 



Figure 2 



<0 2 JN 
CC Q 



CO 2<o 
c 

O CO , 

= — J 
O CD 



en 

■a a) w 

Sfl 



a xi 

O CD 

c Q 

=3 a 

co £^ 

^ LL =d 

1 

> 



* J 




CD 


Write Dc 

S.pneuman 

[lassie] 


f 

MBLY 




jence retrieval 
ambiguity filter 
;eq, seqjilter] 


ASSEI 


Rapid A 
and Ore 
lousands c 

[asm_ 




1- 


cr -a i: 

<D C >< 






CO co ^ 







